llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-06-27 20:05:20 +00:00

Files

Kerfuffle 91f6499393 Respect tokenizer.ggml.add_bos_token value when tokenizing (#4040 )

* gguf-py: gguf-dump: Respect --no-tensor flag in JSON mode.

* Respect add_bos_token GGUF metadata value

* gguf-py: Try to fix SpecialVocab giving up too easily for the Nth time

2023-11-16 19:14:37 -07:00

__init__.py

gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981 )

2023-11-11 08:04:50 +03:00

gguf-convert-endian.py

Fix gguf-convert-endian script (#4037 )

2023-11-11 08:35:31 -07:00

gguf-dump.py

Respect tokenizer.ggml.add_bos_token value when tokenizing (#4040 )

2023-11-16 19:14:37 -07:00

gguf-set-metadata.py

gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981 )

2023-11-11 08:04:50 +03:00