Commit Graph

3 Commits

Author SHA1 Message Date
Eve
e536426ded llamafile : disable sgemm for batch-size 1 (#9330) 2024-09-07 22:02:26 +03:00
Srihari-mcw
ea5d7478b1 sgemm : improved Q4_0 and Q8_0 performance via 4xN and Mx4 gemm (#8908) 2024-08-31 11:20:35 +03:00
Georgi Gerganov
6b2a849d1f ggml : move sgemm sources to llamafile subfolder (#8394)
ggml-ci
2024-07-10 15:23:29 +03:00