llama.cpp/unit at 5ad021f9244023cbd3ac4e58902d5724184b79d1 - llama.cpp - Cat's Mantra

tqcq/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-27 02:28:19 -04:00

Files

History

Sigbjørn Skjæret ddef99522d server : fix assistant prefilling when content is an array (#14360 )

2025-07-05 09:17:14 +02:00

..

test_basic.py

llama : move end-user examples to tools directory (#13249 )

2025-05-02 20:27:13 +02:00

test_chat_completion.py

server : fix assistant prefilling when content is an array (#14360 )

2025-07-05 09:17:14 +02:00

test_completion.py

server: fix regression on streamed non-chat completion w/ stops (#13785 )

2025-05-26 14:16:37 +01:00

test_ctx_shift.py

server : do not return error out of context (with ctx shift disabled) (#13577 )

2025-05-16 21:50:00 +02:00

test_embedding.py

llama : move end-user examples to tools directory (#13249 )

2025-05-02 20:27:13 +02:00

test_infill.py

llama : move end-user examples to tools directory (#13249 )

2025-05-02 20:27:13 +02:00

test_lora.py

…

test_rerank.py

llama : move end-user examples to tools directory (#13249 )

2025-05-02 20:27:13 +02:00

test_security.py

llama : move end-user examples to tools directory (#13249 )

2025-05-02 20:27:13 +02:00

test_slot_save.py

llama : move end-user examples to tools directory (#13249 )

2025-05-02 20:27:13 +02:00

test_speculative.py

llama : move end-user examples to tools directory (#13249 )

2025-05-02 20:27:13 +02:00

test_template.py

server: add --reasoning-budget 0 to disable thinking (incl. qwen3 w/ enable_thinking:false) (#13771 )

2025-05-26 00:30:51 +01:00

test_tokenize.py

llama : move end-user examples to tools directory (#13249 )

2025-05-02 20:27:13 +02:00

test_tool_call.py

server: update deepseek reasoning format (pass reasoning_content as diffs) (#13933 )

2025-06-02 10:15:44 -07:00

test_vision_api.py

server : support audio input (#13714 )

2025-05-23 11:03:47 +02:00