llama.cpp/ggml at fa62da9b2dc03539a30f1306f59c4c6ffbe4f50a - llama.cpp - Cat's Mantra

tqcq/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-07-28 13:20:27 -04:00

Files

History

Johannes Gäßler fa62da9b2d CUDA: support for mat. mul. with ne03 != ne13 (#11656 )

2025-02-05 08:58:31 +01:00

..

cmake: add ggml find package (#11369 )

2025-01-26 12:07:48 -04:00

CUDA: use mma PTX instructions for FlashAttention (#11583 )

2025-02-02 19:31:09 +01:00

CUDA: support for mat. mul. with ne03 != ne13 (#11656 )

2025-02-05 08:58:31 +01:00

.gitignore

…

CMakeLists.txt

cmake: Add ability to pass in GGML_BUILD_NUMBER (ggml/1096)

2025-02-04 12:59:15 +02:00