mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2025-07-31 14:44:34 -04:00
Add more tokenizer tests (#3742)
* Add more tokenizer tests * Add starcoder * Update test vocab files * Restrict bpe tokenizer tests to unicode planes * Update comment * Comment cosmetics * Remove bloom vocab/test
This commit is contained in:
BIN
models/ggml-vocab-baichuan.gguf
Normal file
BIN
models/ggml-vocab-baichuan.gguf
Normal file
Binary file not shown.
Reference in New Issue
Block a user