llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-05 08:28:37 -04:00

Files

Xuan-Son Nguyen 8f22dc0a53 model : add hunyuan moe (#14425 )

* model : add hunyuan moe

* tokenizer ok

* fix tensor name

* cgraph init

* chat template

* wip

* almost working

* skip embed, fix bos

* cleanup

* yarn scaling

* cleanup

* correct rope type

* failed token fix

* ntk alpha freq_base

* tokenization working

* cleanup and pr changes

* vocab_size sanity check

* ntk alpha generic

* Update convert_hf_to_gguf.py

* Apply suggestions from code review

* fix regression

* fix style

---------

Co-authored-by: kooshi <1934337+kooshi@users.noreply.github.com>

2025-07-08 11:24:06 +03:00

llama-cpp.h

llama : add llama_vocab, functions -> methods, naming (#11110 )

2025-01-12 11:32:42 +02:00

llama.h

model : add hunyuan moe (#14425 )

2025-07-08 11:24:06 +03:00