llama.cpp/examples at b4704 - llama.cpp - Cat's Mantra

tqcq/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-07-26 19:23:37 -04:00

Files

History

Oleksandr Kuvshynov e4376270d9 llama.cpp: fix warning message (#11839 )

There was a typo-like error, which would print the same number twice if
request is received with n_predict > server-side config.

Before the fix:
```
slot launch_slot_: id  0 | task 0 | n_predict = 4096 exceeds server configuration, setting to 4096
```

After the fix:
```
slot launch_slot_: id  0 | task 0 | n_predict = 8192 exceeds server configuration, setting to 4096
```

2025-02-13 08:25:34 +02:00

..

…

…

swift : fix llama-vocab api usage (#11645 )

2025-02-04 13:15:24 +02:00

convert-llama2c-to-ggml

…

cvector-generator

…

deprecation-warning

…

…

…

…

…

…

…

…

…

…

Fix: Compile failure due to Microsoft STL breaking change (#11836 )

2025-02-12 21:36:11 +01:00

…

…

…

…

swift : fix llama-vocab api usage (#11645 )

2025-02-04 13:15:24 +02:00

llava: add quantization for the visual projector LLAVA, Qwen2VL (#11644 )

2025-02-05 10:45:40 +03:00

…

…

Update README.md [no ci] (#11781 )

2025-02-10 09:05:57 +01:00

…

…

Fix: Compile failure due to Microsoft STL breaking change (#11836 )

2025-02-12 21:36:11 +01:00

…

…

…

…

There's a better way of clearing lines (#11756 )

2025-02-09 10:34:49 +00:00

save-load-state

…

llama.cpp: fix warning message (#11839 )

2025-02-13 08:25:34 +02:00

…

…

simple-cmake-pkg

…

…

speculative-simple

…

…

…

…

chat-13B.bat

…

chat-13B.sh

…

chat-persistent.sh

…

chat-vicuna.sh

…

chat.sh

…

CMakeLists.txt

…

convert_legacy_llama.py

…

json_schema_pydantic_example.py

…

json_schema_to_grammar.py

…

llama.vim

…

llm.vim

…

Miku.sh

…

pydantic_models_to_grammar_examples.py

…

pydantic_models_to_grammar.py

…

reason-act.sh

…

regex_to_grammar.py

…

server_embd.py

…

server-llama2-13B.sh

…

ts-type-to-grammar.sh

…