Logo
Explore Help
Sign In
tqcq/llama.cpp
0
0
Fork 0
You've already forked llama.cpp
mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-28 11:08:19 -04:00
Code Issues Packages Projects Releases Wiki Activity
Files
e8f8c2c71189e9074b810cb14376c413a1eadcd0
llama.cpp/tools
History
Sigbjørn Skjæret e8f8c2c711 fix assistant prefilling when content is an array
2025-06-24 12:01:37 +02:00
..
batched-bench
llama : deprecate llama_kv_self_ API (#14030)
2025-06-06 14:11:15 +03:00
cvector-generator
llama : deprecate llama_kv_self_ API (#14030)
2025-06-06 14:11:15 +03:00
export-lora
…
gguf-split
…
imatrix
llama : deprecate llama_kv_self_ API (#14030)
2025-06-06 14:11:15 +03:00
llama-bench
llama-bench : add --no-warmup flag (#14224) (#14270)
2025-06-19 12:24:12 +02:00
main
main : honor --verbose-prompt on interactive prompts (#14350)
2025-06-24 09:31:00 +02:00
mtmd
mtmd : fix Pixtral OOM with large images by capping image_size to 1024 (#14326)
2025-06-22 14:44:57 +02:00
perplexity
llama : deprecate llama_kv_self_ API (#14030)
2025-06-06 14:11:15 +03:00
quantize
quantize : handle user-defined pruning of whole layers (blocks) (#13037)
2025-06-22 23:16:26 +02:00
rpc
rpc : Fix build on OpenBSD (#13541)
2025-05-25 15:35:53 +03:00
run
run : avoid double tokenization (#14327)
2025-06-23 01:28:06 +08:00
server
fix assistant prefilling when content is an array
2025-06-24 12:01:37 +02:00
tokenize
…
tts
sync : vendor (#13901)
2025-05-30 16:25:45 +03:00
CMakeLists.txt
mtmd : rename llava directory to mtmd (#13311)
2025-05-05 16:02:55 +02:00
Powered by Gitea Version: 1.24.5 Page: 2765ms Template: 359ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API