llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-05 00:25:26 -04:00

Files

Georgi Gerganov e6e7c75d94 server : fix extra BOS in infill endpoint (#11106 )

* server : fix extra BOS in infill endpoing

ggml-ci

* server : update infill tests

2025-01-06 15:36:08 +02:00

test_basic.py

2024-12-10 18:22:34 +01:00

test_chat_completion.py

2024-12-31 15:22:01 +01:00

test_completion.py

2024-12-31 12:34:13 +01:00

test_ctx_shift.py

…

test_embedding.py

2024-12-24 21:33:04 +01:00

test_infill.py

2025-01-06 15:36:08 +02:00

test_lora.py

2025-01-02 15:05:18 +01:00

test_rerank.py

2024-12-17 18:00:24 +02:00

test_security.py

…

test_slot_save.py

…

test_speculative.py

2025-01-02 15:05:18 +01:00

test_tokenize.py

…