Georgi Gerganov
de2ef53a4b
kv-cache : rework kv_cell (#13706)
* kv-cache : rework kv_cell
ggml-ci
* kv-cells : use "shift" instead of "delta" consistently
ggml-ci
* llama : add llama_max_parallel_sequences()
ggml-ci
* kv-cells : update comments [no ci]
* context : fail upon construction if sequences exceed max value
ggml-ci
* kv-cells : get_pos() -> pos_get() + comments
ggml-ci
* kv-cells : fix tracking of "used" cells
ggml-ci
2025-05-25 16:34:36 +03:00
..
2025-05-12 14:44:49 +02:00
2025-05-09 13:02:07 +02:00
2025-03-13 12:35:44 +02:00
2025-05-13 15:12:01 +02:00
2025-04-28 22:52:15 +03:00
2025-05-21 15:11:13 +03:00
2025-05-02 17:48:36 +03:00
2025-05-09 11:17:51 +02:00
2025-05-09 11:17:51 +02:00
2025-05-25 16:34:36 +03:00
2025-05-12 14:44:49 +02:00
2025-05-25 16:34:36 +03:00
2025-05-25 16:34:36 +03:00
2025-05-25 01:48:08 +01:00
2025-03-05 13:05:13 +00:00
2025-05-24 16:49:12 +02:00
2025-05-20 08:05:46 +03:00
2025-05-23 20:16:13 +03:00
2025-05-23 20:16:13 +03:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-05-25 16:34:36 +03:00
2025-05-25 16:34:36 +03:00
2025-05-25 16:34:36 +03:00
2025-03-13 12:35:44 +02:00
2025-05-25 16:34:36 +03:00
2025-03-24 12:17:10 +02:00
2025-05-15 19:13:11 +02:00
2025-04-02 14:52:01 +02:00
2025-05-12 14:44:49 +02:00
2025-05-12 14:44:49 +02:00
2025-05-25 10:29:43 +02:00
2025-05-20 08:05:46 +03:00
2025-05-13 19:12:31 +02:00
2025-05-06 22:36:24 +02:00
2025-05-24 12:29:09 +02:00
2025-05-12 14:44:49 +02:00
2025-05-16 16:38:07 +02:00
2025-02-15 16:40:57 +02:00