Logo
Explore Help
Sign In
tqcq/llama.cpp
0
0
Fork 0
You've already forked llama.cpp
mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-07-19 09:08:04 +00:00
Code Issues Packages Projects Releases Wiki Activity
Files
3600cc2886956fc0a07ef6ad2f4128ccfdbc8c6f
llama.cpp/tools
History
Georgi Gerganov 3600cc2886 llama : use n_swa + n_ubatch cells for SWA cache (#13833)
* llama : use n_swa + n_ubatch cells for SWA cache

ggml-ci

* llama : add warning about multi-sqeuence SWA contexts
2025-05-31 15:57:44 +03:00
..
batched-bench
batched-bench : fix pp batch contents (#13492)
2025-05-13 18:01:53 +03:00
cvector-generator
…
export-lora
…
gguf-split
…
imatrix
imatrix : Add --parse-special for enabling parsing of special tokens in imatrix calculation (#13389)
2025-05-09 11:53:58 +02:00
llama-bench
kv-cache : add SWA support (#13194)
2025-05-20 08:05:46 +03:00
main
llama : do not crash if there is no CPU backend (#13395)
2025-05-09 13:02:07 +02:00
mtmd
mtmd : drop _shared from libmtmd name, merge helpers into libmtmd (⚠️ breaking change) (#13917)
2025-05-31 10:14:29 +02:00
perplexity
context : remove logits_all flag (#13284)
2025-05-08 14:26:50 +03:00
quantize
quantize : improve tensor-type pattern matching (#13033)
2025-05-13 19:12:31 +02:00
rpc
rpc : Fix build on OpenBSD (#13541)
2025-05-25 15:35:53 +03:00
run
sync : vendor (#13901)
2025-05-30 16:25:45 +03:00
server
llama : use n_swa + n_ubatch cells for SWA cache (#13833)
2025-05-31 15:57:44 +03:00
tokenize
…
tts
sync : vendor (#13901)
2025-05-30 16:25:45 +03:00
CMakeLists.txt
mtmd : rename llava directory to mtmd (#13311)
2025-05-05 16:02:55 +02:00
Powered by Gitea Version: 1.24.2 Page: 2505ms Template: 124ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API