This website requires JavaScript.
Explore
Help
Sign In
tqcq
/
llama.cpp
Watch
0
Star
0
Fork
0
You've already forked llama.cpp
mirror of
https://github.com/ggml-org/llama.cpp.git
synced
2025-07-19 09:08:04 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
5896c65232c7dc87d78426956b16f63fbf58dcf6
llama.cpp
/
examples
/
server
/
tests
/
unit
History
Xuan Son Nguyen
5896c65232
server : add OAI compat for /v1/completions (
#10974
)
...
* server : add OAI compat for /v1/completions * add test * add docs * better docs
2024-12-31 12:34:13 +01:00
..
test_basic.py
server : add flag to disable the web-ui (
#10762
) (
#10751
)
2024-12-10 18:22:34 +01:00
test_chat_completion.py
server : add OAI compat for /v1/completions (
#10974
)
2024-12-31 12:34:13 +01:00
test_completion.py
server : add OAI compat for /v1/completions (
#10974
)
2024-12-31 12:34:13 +01:00
test_ctx_shift.py
…
test_embedding.py
server : add support for "encoding_format": "base64" to the */embeddings endpoints (
#10967
)
2024-12-24 21:33:04 +01:00
test_infill.py
server : fix format_infill (
#10724
)
2024-12-08 23:04:29 +01:00
test_lora.py
…
test_rerank.py
server : fill usage info in embeddings and rerank responses (
#10852
)
2024-12-17 18:00:24 +02:00
test_security.py
…
test_slot_save.py
…
test_speculative.py
server : fix speculative decoding with context shift (
#10641
)
2024-12-04 22:38:20 +02:00
test_tokenize.py
…