llama.cpp/unit at 06b5159560de404c018026099bdc636f4d2930c6 - llama.cpp - Cat's Mantra

tqcq/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-05 08:28:37 -04:00

Files

History

ochafik e5113e8d74 Add --jinja and --chat-template-file flags

2024-12-30 03:50:51 +00:00

..

test_basic.py

server : add flag to disable the web-ui (#10762 ) (#10751 )

2024-12-10 18:22:34 +01:00

test_chat_completion.py

Add --jinja and --chat-template-file flags

2024-12-30 03:50:51 +00:00

test_completion.py

ggml : more perfo with llamafile tinyblas on x86_64 (#10714 )

2024-12-24 18:54:49 +01:00

test_ctx_shift.py

…

test_embedding.py

server : add support for "encoding_format": "base64" to the */embeddings endpoints (#10967 )

2024-12-24 21:33:04 +01:00

test_infill.py

server : fix format_infill (#10724 )

2024-12-08 23:04:29 +01:00

test_lora.py

…

test_rerank.py

server : fill usage info in embeddings and rerank responses (#10852 )

2024-12-17 18:00:24 +02:00

test_security.py

…

test_slot_save.py

…

test_speculative.py

server : fix speculative decoding with context shift (#10641 )

2024-12-04 22:38:20 +02:00

test_tokenize.py

…