This website requires JavaScript.
Explore
Help
Sign In
tqcq
/
llama.cpp
Watch
0
Star
0
Fork
0
You've already forked llama.cpp
mirror of
https://github.com/ggml-org/llama.cpp.git
synced
2025-07-09 13:02:12 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
bec2183f2c8d37cf1278c11d1adb9311e9eaa242
llama.cpp
/
examples
/
server
/
tests
/
unit
History
Georgi Gerganov
e6e7c75d94
server : fix extra BOS in infill endpoint (
#11106
)
...
* server : fix extra BOS in infill endpoing ggml-ci * server : update infill tests
2025-01-06 15:36:08 +02:00
..
test_basic.py
server : add flag to disable the web-ui (
#10762
) (
#10751
)
2024-12-10 18:22:34 +01:00
test_chat_completion.py
server : clean up built-in template detection (
#11026
)
2024-12-31 15:22:01 +01:00
test_completion.py
server : add OAI compat for /v1/completions (
#10974
)
2024-12-31 12:34:13 +01:00
test_ctx_shift.py
…
test_embedding.py
server : add support for "encoding_format": "base64" to the */embeddings endpoints (
#10967
)
2024-12-24 21:33:04 +01:00
test_infill.py
server : fix extra BOS in infill endpoint (
#11106
)
2025-01-06 15:36:08 +02:00
test_lora.py
server : allow using LoRA adapters per-request (
#10994
)
2025-01-02 15:05:18 +01:00
test_rerank.py
server : fill usage info in embeddings and rerank responses (
#10852
)
2024-12-17 18:00:24 +02:00
test_security.py
…
test_slot_save.py
…
test_speculative.py
server : allow using LoRA adapters per-request (
#10994
)
2025-01-02 15:05:18 +01:00
test_tokenize.py
…