llama.cpp/examples at 48c857aa10aea73210a4a72da3f1a6f99269e75d - llama.cpp - Cat's Mantra

tqcq/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-12 03:21:10 -04:00

Files

History

Xuan Son Nguyen 48c857aa10 server : refactored the task processing logic (#5065 )

* server: add llama_server_queue struct

* server: add llama_server_response_event

* server: add comments

* server: move all mutexes away from server.cpp

* server: correct multitask response

* server: only add back deferred tasks when one slot is available

* server: fix a race condition cause by "request_completion"

2024-01-26 14:42:20 +02:00

..

…

…

…

…

…

…

convert-llama2c-to-ggml

…

…

…

…

…

…

…

parallel : add option to load external prompt file (#3416 )

2023-10-06 16:16:38 +03:00

…

…

llama.swiftui : update models layout (#4826 )

2024-01-12 14:48:00 +02:00

…

…

…

…

…

…

…

…

…

…

save-load-state

…

server : refactored the task processing logic (#5065 )

2024-01-26 14:42:20 +02:00

…

…

…

train-text-from-scratch

…

alpaca.sh

…

base-translate.sh

…

chat-13B.bat

…

chat-13B.sh

…

chat-persistent.sh

…

chat-vicuna.sh

…

chat.sh

…

CMakeLists.txt

…

gpt4all.sh

…

json-schema-to-grammar.py

…

llama2-13b.sh

…

llama2.sh

…

llama.vim

…

llm.vim

…

make-ggml.py

…

Miku.sh

…

pydantic_models_to_grammar.py

…

pydantic-models-to-grammar-examples.py

…

reason-act.sh

…

server-llama2-13B.sh

…