model : more uniform output id handling (#14275)

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-06-28 04:15:21 +00:00

* model : more uniform output id handling

ggml-ci

* cont : revert n_outputs < n_tokens optimization

ggml-ci

* cont : fix out_ids initialization

ggml-ci

This commit is contained in:

Georgi Gerganov

2025-06-20 10:50:27 +03:00

committed by

GitHub

parent 4c9fdfbe15

commit 812939a9e9

2 changed files with 459 additions and 442 deletions

847

src/llama-model.cpp

View File

File diff suppressed because it is too large Load Diff

model : more uniform output id handling (#14275)

847 src/llama-model.cpp View File

847

src/llama-model.cpp

View File