Logo
Explore Help
Sign In
tqcq/llama.cpp
0
0
Fork 0
You've already forked llama.cpp
mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-07-31 22:53:52 -04:00
Code Issues Packages Projects Releases Wiki Activity
Files
ef1e345c85561b63ba9c791b38e76d3f0df5f2bb
llama.cpp/ggml
History
Francis Couture-Harpin ef1e345c85 ggml-quants : Q2_2 now faster than Q4_K on with AVX2
2024-06-27 02:06:28 -04:00
..
cmake
llama : reorganize source code + improve CMake (#8006)
2024-06-26 18:33:02 +03:00
include
ggml-quants : 1.625 bpw ternary packing for BitNet 1.58b
2024-06-27 02:06:22 -04:00
src
ggml-quants : Q2_2 now faster than Q4_K on with AVX2
2024-06-27 02:06:28 -04:00
CMakeLists.txt
ggml : add GGML_CUDA_USE_GRAPHS option, restore GGML_CUDA_FORCE_CUBLAS (cmake) (#8140)
2024-06-26 21:34:14 +02:00
ggml_vk_generate_shaders.py
llama : reorganize source code + improve CMake (#8006)
2024-06-26 18:33:02 +03:00
Powered by Gitea Version: 1.24.3 Page: 1565ms Template: 141ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API