Georgi Gerganov
a10b36c91a
llama : refactor kv cache guard (#12695)
* llama : refactor kv cache guard
ggml-ci
* cont : fix comment [no ci]
* llama : fix kv_cache restore logic
ggml-ci
* context : simplify kv cache updates
ggml-ci
* cont : better name [no ci]
* llama : fix llama_decode return code when could not find KV slot
ggml-ci
* context : change log err -> warn [no ci]
* kv-cache : add comment + warning
2025-04-02 14:32:59 +03:00
..
2025-04-01 23:44:05 +02:00
2025-04-01 23:44:05 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-04-01 23:44:05 +02:00
2025-04-01 23:44:05 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-04-01 23:44:05 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-17 21:14:32 +01:00
2025-04-02 14:32:59 +03:00
2025-04-01 23:44:05 +02:00
2025-03-13 12:35:44 +02:00
2025-03-04 18:53:26 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-28 09:44:13 +02:00
2025-03-25 18:46:11 +01:00
2025-03-13 12:35:44 +02:00
2025-04-02 09:58:34 +02:00
2025-03-13 12:35:44 +02:00
2025-04-01 23:44:05 +02:00
2025-04-01 23:44:05 +02:00
2025-02-24 22:33:23 +08:00
2025-04-01 23:44:05 +02:00
2025-03-05 13:05:13 +00:00