llama.cpp/examples at 2f0ee84b9b02d2a98742308026f060ebdc2423f1 - llama.cpp - Cat's Mantra

tqcq/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-08 01:26:51 -04:00

Files

History

Pierrick Hymbert 2f0ee84b9b server: bench: minor fixes (#10765 )

* server/bench:
- support openAI streaming standard output with [DONE]\n\n
- export k6 raw results in csv
- fix too many tcp idle connection in tcp_wait
- add metric time to emit first token

* server/bench:
- fix when prometheus not started
- wait for server to be ready before starting bench

2025-01-02 18:06:12 +01:00

..

sampling : refactor + optimize penalties sampler (#10803 )

2024-12-16 12:31:14 +02:00

ggml : move AMX to the CPU backend (#10570 )

2024-11-29 21:54:58 +01:00

…

convert-llama2c-to-ggml

make : deprecate (#10514 )

2024-12-02 21:22:53 +02:00

cvector-generator

examples, ggml : fix GCC compiler warnings (#10983 )

2024-12-26 14:59:11 +01:00

deprecation-warning

Update deprecation-warning.cpp (#10619 )

2024-12-04 23:19:20 +01:00

ggml : move AMX to the CPU backend (#10570 )

2024-11-29 21:54:58 +01:00

ggml : move AMX to the CPU backend (#10570 )

2024-11-29 21:54:58 +01:00

examples, ggml : fix GCC compiler warnings (#10983 )

2024-12-26 14:59:11 +01:00

llama : minor grammar refactor (#10897 )

2024-12-19 17:42:13 +02:00

ggml : move AMX to the CPU backend (#10570 )

2024-11-29 21:54:58 +01:00

ggml : move AMX to the CPU backend (#10570 )

2024-11-29 21:54:58 +01:00

ggml : move AMX to the CPU backend (#10570 )

2024-11-29 21:54:58 +01:00

remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797 )

2024-12-12 19:02:49 +01:00

server : output embeddings for all tokens when pooling = none (#10861 )

2024-12-18 13:01:41 +02:00

make : deprecate (#10514 )

2024-12-02 21:22:53 +02:00

readme : add option, update default value, fix formatting (#10271 )

2024-12-03 12:50:08 +02:00

…

remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797 )

2024-12-12 19:02:49 +01:00

android : fix llama_batch free (#11014 )

2024-12-30 14:35:13 +02:00

llama : use cmake for swift build (#10525 )

2024-12-08 13:14:54 +02:00

clip : disable GPU support (#10896 )

2024-12-19 18:47:15 +02:00

ggml : move AMX to the CPU backend (#10570 )

2024-11-29 21:54:58 +01:00

ggml : move AMX to the CPU backend (#10570 )

2024-11-29 21:54:58 +01:00

sampling : refactor + optimize penalties sampler (#10803 )

2024-12-16 12:31:14 +02:00

ggml : move AMX to the CPU backend (#10570 )

2024-11-29 21:54:58 +01:00

ggml : move AMX to the CPU backend (#10570 )

2024-11-29 21:54:58 +01:00

ggml : move AMX to the CPU backend (#10570 )

2024-11-29 21:54:58 +01:00

ggml : move AMX to the CPU backend (#10570 )

2024-11-29 21:54:58 +01:00

Update README.md (#10772 )

2024-12-11 16:16:32 +01:00

ggml : move AMX to the CPU backend (#10570 )

2024-11-29 21:54:58 +01:00

server : output embeddings for all tokens when pooling = none (#10861 )

2024-12-18 13:01:41 +02:00

rpc-server : add support for the SYCL backend (#10934 )

2024-12-23 10:39:30 +02:00

common, examples, ggml : fix MSYS2 GCC compiler errors and warnings when building with LLAMA_CURL=ON and GGML_OPENCL=ON (#11013 )

2024-12-31 01:46:06 +01:00

save-load-state

ggml : move AMX to the CPU backend (#10570 )

2024-11-29 21:54:58 +01:00

server: bench: minor fixes (#10765 )

2025-01-02 18:06:12 +01:00

ggml : move AMX to the CPU backend (#10570 )

2024-11-29 21:54:58 +01:00

ggml : move AMX to the CPU backend (#10570 )

2024-11-29 21:54:58 +01:00

ggml : move AMX to the CPU backend (#10570 )

2024-11-29 21:54:58 +01:00

speculative-simple

ggml : move AMX to the CPU backend (#10570 )

2024-11-29 21:54:58 +01:00

…

remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797 )

2024-12-12 19:02:49 +01:00

tts : add OuteTTS support (#10784 )

2024-12-18 19:27:21 +02:00

chat-13B.bat

…

chat-13B.sh

…

chat-persistent.sh

…

chat-vicuna.sh

…

chat.sh

…

CMakeLists.txt

tts : add OuteTTS support (#10784 )

2024-12-18 19:27:21 +02:00

convert_legacy_llama.py

…

json_schema_pydantic_example.py

…

json_schema_to_grammar.py

…

llama.vim

…

llm.vim

…

Miku.sh

…

pydantic_models_to_grammar_examples.py

…

pydantic_models_to_grammar.py

…

reason-act.sh

…

regex_to_grammar.py

…

server_embd.py

…

server-llama2-13B.sh

…

ts-type-to-grammar.sh

…