ggml : refactor llamafile_sgemm PPC code (#14673)

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-07-28 13:20:27 -04:00

Remove un-necessary templates from class definition and packing functions
Reduce deeply nested conditionals, if-else switching in mnapck function
Replace repetitive code with inline functions in Packing functions

2 ~ 7% improvement in Q8 Model
15 ~ 50% improvement in Q4 Model

Signed-off-by: Shalini Salomi Bodapati <Shalini.Salomi.Bodapati@ibm.com>

This commit is contained in:

shalinib-ibm

2025-07-14 18:46:42 +05:30

committed by

GitHub

parent 9c9e4fc635

commit 55c509daf5

1 changed files with 340 additions and 1091 deletions

1431

ggml/src/ggml-cpu/llamafile/sgemm.cpp

View File

File diff suppressed because it is too large Load Diff

ggml : refactor llamafile_sgemm PPC code (#14673)

1431 ggml/src/ggml-cpu/llamafile/sgemm.cpp View File

1431

ggml/src/ggml-cpu/llamafile/sgemm.cpp

View File