llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-17 21:51:27 -04:00

Files

Georgi Gerganov 5783ae4359 metal : batch rows copy in a single threadgroup (#14384 )

* metal : batch rows copy in a single threadgroup

ggml-ci

* metal : handle some edge cases when threadgroup size is not a power of 2

ggml-ci

2025-06-26 15:50:15 +03:00

2025-06-16 13:54:15 +08:00

2025-06-25 23:49:04 +02:00

2025-06-26 15:50:15 +03:00

.gitignore

…

CMakeLists.txt

2025-06-25 23:49:04 +02:00