mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-06-28 20:25:20 +00:00

Files

Jared Van Bortel 1b67731e18 BERT tokenizer fixes (#6498 )

Key changes:
* BERT conversion: fix abuse of LlamaHfVocab, do not set BOS or EOS
* Nomic Embed conversion: pad vocab instead of slicing embedding tensor
* llama_tokenize: handle added special tokens like HF does

2024-04-09 13:44:08 -04:00

CMakeLists.txt

lookahead : add example for lookahead decoding (#4207 )

2023-11-26 20:33:07 +02:00

lookahead.cpp

BERT tokenizer fixes (#6498 )

2024-04-09 13:44:08 -04:00

README.md

english : use typos to fix comments and logs (#4354 )

2023-12-12 11:53:36 +02:00

README.md

llama.cpp/examples/lookahead

Demonstration of lookahead decoding technique:

https://lmsys.org/blog/2023-11-21-lookahead-decoding/

More info: https://github.com/ggerganov/llama.cpp/pull/4207