Commit Graph

5817 Commits

Author SHA1 Message Date
f71b21d2f7 ggml-cpu: dedup ggml_table_f32_f16 from simd-mappings.h
we rely on the variable declaration in ggml-cpu.c instead

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-26 00:09:14 +08:00
5f2a09a8f6 ggml-cpu: extern c ggml_table_f32_f16 + chore docs
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-25 23:29:08 +08:00
6cebee25d0 ggml: move ggml_table_f32_f16 to ggml-cpu.c
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-25 22:33:12 +08:00
59b48e4800 ggml: move ggml_table_f32_f16 to ggml-cpu
ref: https://github.com/ggml-org/llama.cpp/pull/14317#discussion_r2164775006

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
(cherry picked from commit 9e40d984ad)
2025-06-25 22:29:27 +08:00
5be39c1143 Revert "ggml: move ggml_table_f32_f16 to ggml-cpu"
This reverts commit 9e40d984ad.

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-25 17:00:20 +08:00
827fce9cf8 Revert "ggml-cpu: move ggml_table_f32_f16 back to ggml-base due to ci failures"
This reverts commit 32a3533564.

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-25 17:00:18 +08:00
32a3533564 ggml-cpu: move ggml_table_f32_f16 back to ggml-base due to ci failures
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-25 16:37:48 +08:00
9e40d984ad ggml: move ggml_table_f32_f16 to ggml-cpu
ref: https://github.com/ggml-org/llama.cpp/pull/14317#discussion_r2164775006

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-25 14:58:25 +08:00
1b23fec005 ggml-cpu: remove mistaken fallback macro
fallback logic was already implemented but i was too sleepy to realise

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-25 01:31:53 +08:00
a02b360f2c ggml-cpu: rename all fp16<->fp32 macros to prefix with ggml_cpu
ref: https://github.com/ggml-org/llama.cpp/pull/14317#discussion_r2164449406

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-25 01:07:58 +08:00
64568ffb2d ggml: remove dependency on ggml-cpu from ggml-base
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-24 23:09:15 +08:00
1e6ebb2b1b ggml-cpu: fix wrong refactor of ggml-base
ref: https://github.com/ggml-org/llama.cpp/pull/14317#discussion_r2164176555

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-24 22:56:05 +08:00
e4a7f84d37 ggml-cpu: move nnpa together with other fp16<->fp32 simd
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-24 22:31:39 +08:00
e4666f93d3 ggml-cpu: attempt at fixing loongarch failing build
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-24 22:29:08 +08:00
3c055a421e ggml-cpu: fix amx mmq missing simd-mappings.h
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-24 22:27:11 +08:00
e615f73b02 ggml-cpu: fix missing simd-mappings.h within repack
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-24 21:29:15 +08:00
0367b803e9 ggml-cpu: fix missing simd-mappings.h import in quants.c
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-24 21:09:47 +08:00
17b032fab8 ggml: refactor fp16<->fp32 simd to ggml-cpu
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-24 20:42:15 +08:00
8a5e011cb5 Revert "ggml: refactor fp32->fp16 and fp16->fp32 simd to ggml-cpu"
This reverts commit bd288e8fa5.

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-24 15:54:33 +08:00
e73413bb98 Revert "ggml-cpu: fix duplicate func names during compile"
This reverts commit fbb733451f.

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-24 15:54:18 +08:00
fbb733451f ggml-cpu: fix duplicate func names during compile
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-24 15:18:20 +08:00
4d136cb6a1 docs: update broken huggingface link for s390x
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-24 15:11:58 +08:00
bd288e8fa5 ggml: refactor fp32->fp16 and fp16->fp32 simd to ggml-cpu
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-24 15:05:46 +08:00
5834dee1fc ggml-cpu: move nnpa fp16->fp32 and fp32->fp16 to simd-mappings
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-23 17:52:28 +08:00
5004e4395b ggml-cpu: remove unnecessary target compile definitions
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 23:37:44 +08:00
489cdf44bf ggml-cpu: clarify naming of dlf16
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 23:34:42 +08:00
07de57c69a ggml-cpu: add todo comment for future reference
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 21:07:30 +08:00
72965ea8b0 ggml-cpu: add ggml-impl.h future notes
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 21:06:57 +08:00
46227c61c9 ggml-cpu: remove typedef from cmakelists
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 21:02:05 +08:00
1b4dbf477c ggml-cpu: remove typedefs.h
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 21:01:21 +08:00
5c9b083511 Revert "ggml-cpu: move s390x typedef to own header file"
This reverts commit 18d79e1a30.

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 20:59:04 +08:00
e43dc82a21 ggml-cpu: undo cmakelists work
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 20:56:46 +08:00
3ec0bdc1df ggml-cpu: bring back compile definitions
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 20:47:05 +08:00
ebb8489a0c ggml-cpu: add s390x detection in ggml-src
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 20:38:55 +08:00
c8b3b89548 ggml-cpu: add compiler error macro
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 20:17:21 +08:00
04a395ea73 ggml-cpu: switch to quotes for import
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 20:11:51 +08:00
263b820b42 ggml-cpu: bring back compile macros
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 20:09:04 +08:00
781c263722 ggml-cpu: move things around
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 19:41:01 +08:00
18d79e1a30 ggml-cpu: move s390x typedef to own header file
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
(cherry picked from commit 157f856c34)
2025-06-21 19:31:34 +08:00
ba3513e44b ggml-cpu: switch to private macros
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 19:27:55 +08:00
a91c3ab6b0 ggml-cpu: add ggml-impl.h to cmakelists
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 19:19:11 +08:00
72c91436f6 ggml-cpu: move macro definitions
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 19:16:40 +08:00
84593387a7 ggml-cpu: bruteforce macro definitions
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 19:14:31 +08:00
ed76ff6e42 ggml-cpu: add debug prints
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 19:11:59 +08:00
fadc138763 ggml-cpu: test more macros
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 19:10:39 +08:00
1cacdd9a36 ggml-cpu: fix macro declaration
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 19:08:48 +08:00
3004a79f4b ggml-cpu: switch to importing ggml-cpu-impl instead
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 19:04:09 +08:00
48df977079 Revert "ggml-cpu: move s390x typedef to own header file"
This reverts commit 157f856c34.

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 19:03:09 +08:00
157f856c34 ggml-cpu: move s390x typedef to own header file
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 19:00:20 +08:00
e7910fc975 ggml-cpu: update macro tests
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-21 18:43:43 +08:00