llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745)

* refactor llama_batch_get_one * adapt all examples * fix simple.cpp * fix llama_bench * fix * fix context shifting * free batch before return * use common_batch_add, reuse llama_batch in loop * null terminated seq_id list * fix save-load-state example * fix perplexity * correct token pos in llama_batch_allocr
2025-06-27 03:55:20 +00:00 · 2024-10-18 23:18:01 +02:00
parent afd9909a64
commit cda0e4b648
22 changed files with 205 additions and 118 deletions
--- a/examples/server/server.cpp
+++ b/examples/server/server.cpp
@ -2326,7 +2326,6 @@ struct server_context {
                batch.n_seq_id + i,
                batch.seq_id   + i,
                batch.logits   + i,
-                0, 0, 0, // unused
            };

            const int ret = llama_decode(ctx, batch_view);