Logo
Explore Help
Sign In
tqcq/llama.cpp
0
0
Fork 0
You've already forked llama.cpp
mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-07-28 13:20:27 -04:00
Code Issues Packages Projects Releases Wiki Activity
Files
8b5e19aea6ce9fe4452598663924373234041440
llama.cpp/tools
History
Isaac McFadyen 6a2bc8bfb7 server : added --no-prefill-assistant flag (#13608)
* added no-prefill-assistant flag

* reworded documentation comment

* updated server README.md
2025-05-17 23:59:48 +02:00
..
batched-bench
batched-bench : fix pp batch contents (#13492)
2025-05-13 18:01:53 +03:00
cvector-generator
…
export-lora
…
gguf-split
…
imatrix
imatrix : Add --parse-special for enabling parsing of special tokens in imatrix calculation (#13389)
2025-05-09 11:53:58 +02:00
llama-bench
llama-bench : fix -ot with dl backends (#13563)
2025-05-15 15:46:55 +02:00
main
llama : do not crash if there is no CPU backend (#13395)
2025-05-09 13:02:07 +02:00
mtmd
clip : clip.h become private API (⚠️ breaking change) (#13510)
2025-05-13 17:07:21 +02:00
perplexity
context : remove logits_all flag (#13284)
2025-05-08 14:26:50 +03:00
quantize
quantize : improve tensor-type pattern matching (#13033)
2025-05-13 19:12:31 +02:00
rpc
llama : do not crash if there is no CPU backend (#13395)
2025-05-09 13:02:07 +02:00
run
llama-run: add support for downloading models from ModelScope (#13370)
2025-05-09 10:25:50 +01:00
server
server : added --no-prefill-assistant flag (#13608)
2025-05-17 23:59:48 +02:00
tokenize
…
tts
…
CMakeLists.txt
mtmd : rename llava directory to mtmd (#13311)
2025-05-05 16:02:55 +02:00
Powered by Gitea Version: 1.24.3 Page: 1665ms Template: 162ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API