Logo
Explore Help
Sign In
tqcq/llama.cpp
0
0
Fork 0
You've already forked llama.cpp
mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-14 20:29:41 -04:00
Code Issues Packages Projects Releases Wiki Activity
Files
e121edc4324a640be11b7e567edd39b721b0f8e4
llama.cpp/tools/server/tests/unit
History
Olivier Chafik e121edc432 server: add --reasoning-budget 0 to disable thinking (incl. qwen3 w/ enable_thinking:false) (#13771)
---------

Co-authored-by: ochafik <ochafik@google.com>
Co-authored-by: Xuan-Son Nguyen <thichthat@gmail.com>
2025-05-26 00:30:51 +01:00
..
test_basic.py
…
test_chat_completion.py
server: streaming of tool calls and thoughts when --jinja is on (#12379)
2025-05-25 01:48:08 +01:00
test_completion.py
server : fix cache_tokens bug with no cache_prompt (#13533)
2025-05-14 13:35:07 +02:00
test_ctx_shift.py
server : do not return error out of context (with ctx shift disabled) (#13577)
2025-05-16 21:50:00 +02:00
test_embedding.py
…
test_infill.py
…
test_lora.py
…
test_rerank.py
…
test_security.py
…
test_slot_save.py
…
test_speculative.py
…
test_template.py
server: add --reasoning-budget 0 to disable thinking (incl. qwen3 w/ enable_thinking:false) (#13771)
2025-05-26 00:30:51 +01:00
test_tokenize.py
…
test_tool_call.py
server: streaming of tool calls and thoughts when --jinja is on (#12379)
2025-05-25 01:48:08 +01:00
test_vision_api.py
server : support audio input (#13714)
2025-05-23 11:03:47 +02:00
Powered by Gitea Version: 1.24.4 Page: 1423ms Template: 20ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API