llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-07-30 22:23:31 -04:00

Files

Johannes Gäßler 69c487f4ed CUDA: MMQ code deduplication + iquant support (#8495 )

* CUDA: MMQ code deduplication + iquant support

* 1 less parallel job for CI build

2024-07-20 22:25:26 +02:00

…

2024-07-18 23:48:47 +02:00

2024-07-20 22:25:26 +02:00

.gitignore

…

CMakeLists.txt

2024-07-18 17:47:12 +03:00