llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-09-26 01:00:15 -04:00

Files

Jeff Bolz ba1ceb3456 vulkan: fix noncontig check for mat_mul_id splitting (#14683 )

* vulkan: fix noncontig check for mat_mul_id splitting

Remove supports_op check for > 4096 (splitting fixes this)

* vulkan: fix batched matmul dequant for Q*_K

2025-07-15 21:51:09 +02:00

cmake

ggml-cpu : rework weak alias on apple targets (#14146 )

2025-06-16 13:54:15 +08:00

include

ggml : add ggml_scale_bias (#14417 )

2025-07-09 18:16:12 +02:00

src

vulkan: fix noncontig check for mat_mul_id splitting (#14683 )

2025-07-15 21:51:09 +02:00

.gitignore

…

CMakeLists.txt

ggml : remove kompute backend (#14501 )

2025-07-03 07:48:32 +03:00