llama.cpp/examples at 12d0188c0dc6146ffde6d277a93f232ccbe699f8 - llama.cpp - Cat's Mantra

tqcq/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-06 17:13:34 -04:00

Files

History

Georgi Gerganov 12d0188c0d kv-cache : refactor + add llama_memory_state_i (#13746 )

* kv-cache : simplify the "struct llama_kv_cache" interface

ggml-ci

* kv-cache : revert the (n_swa + n_ubatch) change (for next PR)

ggml-ci

* kv-cache : some comments

ggml-ci

* context : fix graph reserve for multiple sequences

ggml-ci

* kv-cache : fix typo [no ci]

* kv-cache : fix find_slot() logic for free slots

ggml-ci

* llama : add TODO for deprecating the defrag API in the future

* kv-cache : improve find_slot() using min/max seq pos info

ggml-ci

* llama : handle aborts and compute errors

ggml-ci

* memory : extract state into llama_memory_state

ggml-ci

* kv-cache : add comments

ggml-ci

* server : update batching logic to reset n_batch on successful decode

* server : upon full re-processing, remove the sequence from the cache

* kv-cache : add TODO for doing split_equal when split_simple fails

ggml-ci

2025-05-31 10:24:04 +03:00

..

common : refactor downloading system, handle mmproj with -hf option (#12694 )

2025-04-01 23:44:05 +02:00

llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181 )

2025-03-13 12:35:44 +02:00

convert-llama2c-to-ggml

llama : add llama_vocab, functions -> methods, naming (#11110 )

2025-01-12 11:32:42 +02:00

deprecation-warning

…

examples : allow extracting embeddings from decoder contexts (#13797 )

2025-05-26 14:03:54 +03:00

llama : add llama_vocab, functions -> methods, naming (#11110 )

2025-01-12 11:32:42 +02:00

…

…

…

common : refactor downloading system, handle mmproj with -hf option (#12694 )

2025-04-01 23:44:05 +02:00

…

cmake : enable curl by default (#12761 )

2025-04-07 13:35:19 +02:00

llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181 )

2025-03-13 12:35:44 +02:00

llama : remove llama_kv_cache_view API + remove deprecated (#13653 )

2025-05-20 16:13:16 +03:00

llama : remove llama_kv_cache_view API + remove deprecated (#13653 )

2025-05-20 16:13:16 +03:00

kv-cache : refactor + add llama_memory_state_i (#13746 )

2025-05-31 10:24:04 +03:00

common : refactor downloading system, handle mmproj with -hf option (#12694 )

2025-04-01 23:44:05 +02:00

examples : allow extracting embeddings from decoder contexts (#13797 )

2025-05-26 14:03:54 +03:00

save-load-state

llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181 )

2025-03-13 12:35:44 +02:00

fix: check model pointer validity before use (#13631 )

2025-05-19 13:25:41 +03:00

kv-cache : simplify the interface (#13660 )

2025-05-21 15:11:13 +03:00

simple-cmake-pkg

repo : update links to new url (#11886 )

2025-02-15 16:40:57 +02:00

common : refactor downloading system, handle mmproj with -hf option (#12694 )

2025-04-01 23:44:05 +02:00

speculative-simple

common : refactor downloading system, handle mmproj with -hf option (#12694 )

2025-04-01 23:44:05 +02:00

sycl : backend documentation review (#13544 )

2025-05-19 14:38:20 +01:00

examples/training: Fix file name in README (#13803 )

2025-05-26 16:55:24 +02:00

chat-13B.bat

…

chat-13B.sh

…

chat-persistent.sh

…

chat-vicuna.sh

…

chat.sh

…

CMakeLists.txt

llama/ggml: add LLM training support (#10544 )

2025-05-12 14:44:49 +02:00

convert_legacy_llama.py

…

json_schema_pydantic_example.py

…

json_schema_to_grammar.py

grammar : handle maxItems == 0 in JSON schema (#13117 )

2025-04-26 10:10:20 +02:00

llama.vim

repo : update links to new url (#11886 )

2025-02-15 16:40:57 +02:00

llm.vim

…

Miku.sh

…

pydantic_models_to_grammar_examples.py

llama : move end-user examples to tools directory (#13249 )

2025-05-02 20:27:13 +02:00

pydantic_models_to_grammar.py

…

reason-act.sh

…

regex_to_grammar.py

…

server_embd.py

llama : fix FA when KV cache is not used (i.e. embeddings) (#12825 )

2025-04-08 19:54:51 +03:00

server-llama2-13B.sh

…

ts-type-to-grammar.sh

…