llama.cpp/unit at e63520f37ac3fe55c1e25adc3be7ae9d5ad90dcb - llama.cpp - Cat's Mantra

tqcq/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-07-05 02:23:54 +00:00

Files

History

ochafik a57bb94e29 Update test_chat_completion.py

2025-01-13 23:18:03 +00:00

..

test_basic.py

server : add flag to disable the web-ui (#10762 ) (#10751 )

2024-12-10 18:22:34 +01:00

test_chat_completion.py

Update test_chat_completion.py

2025-01-13 23:18:03 +00:00

test_completion.py

server : add OAI compat for /v1/completions (#10974 )

2024-12-31 12:34:13 +01:00

test_ctx_shift.py

…

test_embedding.py

server : add support for "encoding_format": "base64" to the */embeddings endpoints (#10967 )

2024-12-24 21:33:04 +01:00

test_infill.py

server : fix extra BOS in infill endpoint (#11106 )

2025-01-06 15:36:08 +02:00

test_lora.py

server : allow using LoRA adapters per-request (#10994 )

2025-01-02 15:05:18 +01:00

test_rerank.py

server : fill usage info in embeddings and rerank responses (#10852 )

2024-12-17 18:00:24 +02:00

test_security.py

…

test_slot_save.py

…

test_speculative.py

server : allow using LoRA adapters per-request (#10994 )

2025-01-02 15:05:18 +01:00

test_tokenize.py

…