This website requires JavaScript.
Explore
Help
Sign In
tqcq
/
llama.cpp
Watch
0
Star
0
Fork
0
You've already forked llama.cpp
mirror of
https://github.com/ggml-org/llama.cpp.git
synced
2025-07-05 02:23:54 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
e63520f37ac3fe55c1e25adc3be7ae9d5ad90dcb
llama.cpp
/
examples
/
server
/
tests
/
unit
History
ochafik
a57bb94e29
Update test_chat_completion.py
2025-01-13 23:18:03 +00:00
..
test_basic.py
server : add flag to disable the web-ui (
#10762
) (
#10751
)
2024-12-10 18:22:34 +01:00
test_chat_completion.py
Update test_chat_completion.py
2025-01-13 23:18:03 +00:00
test_completion.py
server : add OAI compat for /v1/completions (
#10974
)
2024-12-31 12:34:13 +01:00
test_ctx_shift.py
…
test_embedding.py
server : add support for "encoding_format": "base64" to the */embeddings endpoints (
#10967
)
2024-12-24 21:33:04 +01:00
test_infill.py
server : fix extra BOS in infill endpoint (
#11106
)
2025-01-06 15:36:08 +02:00
test_lora.py
server : allow using LoRA adapters per-request (
#10994
)
2025-01-02 15:05:18 +01:00
test_rerank.py
server : fill usage info in embeddings and rerank responses (
#10852
)
2024-12-17 18:00:24 +02:00
test_security.py
…
test_slot_save.py
…
test_speculative.py
server : allow using LoRA adapters per-request (
#10994
)
2025-01-02 15:05:18 +01:00
test_tokenize.py
…