llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-06-28 20:25:20 +00:00

Files

Xuan Son Nguyen f30f099228 server : implement cancellable request (#11285 )

* server : implement cancellable request

* fix typo

* httplib 0.18.5

* fix i underflow

2025-01-18 14:12:05 +01:00

test_basic.py

2024-12-10 18:22:34 +01:00

test_chat_completion.py

2024-12-31 15:22:01 +01:00

test_completion.py

2025-01-18 14:12:05 +01:00

test_ctx_shift.py

2024-11-26 16:20:18 +01:00

test_embedding.py

2024-12-24 21:33:04 +01:00

test_infill.py

2025-01-06 15:36:08 +02:00

test_lora.py

2025-01-02 15:05:18 +01:00

test_rerank.py

2024-12-17 18:00:24 +02:00

test_security.py

2024-11-26 16:20:18 +01:00

test_slot_save.py

2024-11-26 16:20:18 +01:00

test_speculative.py

2025-01-02 15:05:18 +01:00

test_tokenize.py

2024-11-26 16:20:18 +01:00