llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-27 02:28:19 -04:00

Files

Georgi Gerganov 1f63e75f3b metal : use less stack memory in FA kernel (#14088 )

* metal : use less stack memory in FA kernel

ggml-ci

* cont : fix BF16 variant

2025-06-09 23:05:02 +03:00

2025-05-29 12:50:25 +02:00

2025-06-01 13:43:57 +03:00

2025-06-09 23:05:02 +03:00

.gitignore

…

CMakeLists.txt

2025-06-09 16:47:13 +02:00