llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-06-30 04:45:17 +00:00

Files

Xuan Son Nguyen 6c5bc0625f server : (refactoring) do not rely on JSON internally (#10643 )

* server : (refactoring) reduce usage of json internally

* move all response types to struct

* wip [no ci]

* many fixes

* add virtual function

* fix index

* minor style fix

* add std::move

* refactor handle_completions_generic

* add virtual functions

* remove server.hpp

* clarify server_sent_event RFC specs

* apply review comments

* fix model_alias and completion_probabilities

* small clean up

* remove virtual for to_json_oai_compat()

* naming oai_compat --> oaicompat

* fix unwanted recursive call

* update docs

2024-12-06 11:14:32 +01:00

test_basic.py

server : add more test cases (#10569 )

2024-11-29 21:48:56 +01:00

test_chat_completion.py

server : (refactoring) do not rely on JSON internally (#10643 )

2024-12-06 11:14:32 +01:00

test_completion.py

server : (refactoring) do not rely on JSON internally (#10643 )

2024-12-06 11:14:32 +01:00

test_ctx_shift.py

server : replace behave with pytest (#10416 )