This website requires JavaScript.
Explore
Help
Sign In
tqcq
/
llama.cpp
Watch
0
Star
0
Fork
0
You've already forked llama.cpp
mirror of
https://github.com/ggml-org/llama.cpp.git
synced
2025-07-30 22:23:31 -04:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
b7c11d36e605b35206901d0e21905f1b99508e33
llama.cpp
/
ggml
History
Johannes Gäßler
69c487f4ed
CUDA: MMQ code deduplication + iquant support (
#8495
)
...
* CUDA: MMQ code deduplication + iquant support * 1 less parallel job for CI build
2024-07-20 22:25:26 +02:00
..
cmake
…
include
CUDA: fix partial offloading for ne0 % 256 != 0 (
#8572
)
2024-07-18 23:48:47 +02:00
src
CUDA: MMQ code deduplication + iquant support (
#8495
)
2024-07-20 22:25:26 +02:00
.gitignore
…
CMakeLists.txt
cmake : install all ggml public headers (
#8480
)
2024-07-18 17:47:12 +03:00