llama.cpp/examples at 23b5e12eb5a76489b4c3ee22213a081da68b1809 - llama.cpp - Cat's Mantra

tqcq/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-15 12:42:40 -04:00

Files

History

Daniel Bevenius 23b5e12eb5 simple : update error message for KV cache check (#4324 )

This commit updates the error message that is printed when the
KV cache is not big enough to hold all the prompt and generated
tokens. Specifically it removes the reference to n_parallel and
replaces it with n_len.

Signed-off-by: Daniel Bevenius <daniel.bevenius@gmail.com>

2023-12-04 18:04:21 +02:00

..

…

…

ggml : add ggml_soft_max_ext (#4256 )

2023-12-01 10:51:24 +02:00

swift : fix prompt tokenization logic (#4321 )

2023-12-04 15:43:45 +02:00

…

sync : ggml (backend v2) (#3912 )

2023-11-13 14:16:23 +02:00

convert-llama2c-to-ggml

…

…

sync : ggml (backend v2) (#3912 )

2023-11-13 14:16:23 +02:00

finetune - update readme to mention llama support only (#4148 )

2023-11-20 19:30:00 +01:00

…

main : Add ChatML functionality to main example (#4046 )

2023-11-20 14:56:59 +01:00

…

…

swift : fix concatenation method to avoid invalid UTF8 stringfication (#4325 )

2023-12-04 18:03:49 +02:00

llava : ShareGPT4V compatibility (vision encoder only loading) (#4172 )

2023-11-30 23:11:14 +01:00

examples : add readme files

2023-11-29 11:00:17 +02:00

main : pass LOG_TEE callback to llama.cpp log (#4033 )

2023-11-30 23:56:19 +02:00

…

sync : ggml (backend v2) (#3912 )

2023-11-13 14:16:23 +02:00

llama : KV cache view API + better KV cache management (#4170 )

2023-11-23 19:07:56 +02:00

Respect tokenizer.ggml.add_bos_token value when tokenizing (#4040 )

2023-11-16 19:14:37 -07:00

…

…

save-load-state

…

server : fix OpenAI API stop field to be optional (#4299 )

2023-12-03 11:10:43 +02:00

simple : update error message for KV cache check (#4324 )

2023-12-04 18:04:21 +02:00

examples : add readme files

2023-11-29 11:00:17 +02:00

tokenize example: Respect normal add BOS token behavior (#4126 )

2023-11-18 14:48:17 -07:00

train-text-from-scratch

sync : ggml (backend v2) (#3912 )

2023-11-13 14:16:23 +02:00

alpaca.sh

…

chat-13B.bat

…

chat-13B.sh

…

chat-persistent.sh

…

chat-vicuna.sh

…

chat.sh

…

CMakeLists.txt

lookahead : add example for lookahead decoding (#4207 )

2023-11-26 20:33:07 +02:00

gpt4all.sh

…

json-schema-to-grammar.py

…

llama2-13b.sh

…

llama2.sh

…

llama.vim

…

llm.vim

…

make-ggml.py

…

Miku.sh

…

reason-act.sh

…

server-llama2-13B.sh

…