Clauszy
06a92a193a
server : fix cache reuse logic ( #12161 )
...
The first kv shift offsets the positions of all tokens after head_c.
When using llama_kv_cache_seq_rm next, using head_c will remove the valid tokens because their positions have already been offset.
2025-03-05 09:25:45 +02:00
..
2025-01-12 11:32:42 +02:00
2025-01-12 11:32:42 +02:00
2025-02-04 13:15:24 +02:00
2025-01-12 11:32:42 +02:00
2025-02-15 16:40:57 +02:00
2025-03-04 18:53:26 +02:00
2025-01-12 11:32:42 +02:00
2025-01-21 14:07:12 +01:00
2025-01-30 19:13:58 +00:00
2025-01-15 18:28:35 +02:00
2025-01-12 11:32:42 +02:00
2025-02-15 21:03:30 +02:00
2025-01-12 11:32:42 +02:00
2025-02-14 02:13:43 +01:00
2025-02-15 16:40:57 +02:00
2025-03-05 06:30:31 +01:00
2025-02-28 11:31:47 +00:00
2025-03-04 18:53:26 +02:00
2025-02-15 16:40:57 +02:00
2025-03-04 12:19:39 -04:00
2025-03-04 18:53:26 +02:00
2025-03-04 18:53:26 +02:00
2025-02-12 21:36:11 +01:00
2025-03-04 18:53:26 +02:00
2025-01-12 11:32:42 +02:00
2025-02-15 16:40:57 +02:00
2025-03-03 12:44:56 +00:00
2025-01-12 11:32:42 +02:00
2025-03-05 09:25:45 +02:00
2025-01-12 11:32:42 +02:00
2025-01-21 13:18:51 +00:00
2025-02-15 16:40:57 +02:00
2025-02-15 16:40:57 +02:00
2025-01-12 11:32:42 +02:00
2025-02-24 22:33:23 +08:00
2025-01-12 11:32:42 +02:00
2025-03-04 18:53:26 +02:00
2025-02-15 16:40:57 +02:00
2025-02-15 16:40:57 +02:00