This website requires JavaScript.
Explore
Help
Sign In
tqcq
/
llama.cpp
Watch
0
Star
0
Fork
0
You've already forked llama.cpp
mirror of
https://github.com/ggml-org/llama.cpp.git
synced
2025-08-05 08:28:37 -04:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
06b5159560de404c018026099bdc636f4d2930c6
llama.cpp
/
examples
/
server
/
tests
/
unit
History
ochafik
e5113e8d74
Add --jinja and --chat-template-file flags
2024-12-30 03:50:51 +00:00
..
test_basic.py
server : add flag to disable the web-ui (
#10762
) (
#10751
)
2024-12-10 18:22:34 +01:00
test_chat_completion.py
Add --jinja and --chat-template-file flags
2024-12-30 03:50:51 +00:00
test_completion.py
ggml : more perfo with llamafile tinyblas on x86_64 (
#10714
)
2024-12-24 18:54:49 +01:00
test_ctx_shift.py
…
test_embedding.py
server : add support for "encoding_format": "base64" to the */embeddings endpoints (
#10967
)
2024-12-24 21:33:04 +01:00
test_infill.py
server : fix format_infill (
#10724
)
2024-12-08 23:04:29 +01:00
test_lora.py
…
test_rerank.py
server : fill usage info in embeddings and rerank responses (
#10852
)
2024-12-17 18:00:24 +02:00
test_security.py
…
test_slot_save.py
…
test_speculative.py
server : fix speculative decoding with context shift (
#10641
)
2024-12-04 22:38:20 +02:00
test_tokenize.py
…