model : add hunyuan moe (#14425)

* model : add hunyuan moe * tokenizer ok * fix tensor name * cgraph init * chat template * wip * almost working * skip embed, fix bos * cleanup * yarn scaling * cleanup * correct rope type * failed token fix * ntk alpha freq_base * tokenization working * cleanup and pr changes * vocab_size sanity check * ntk alpha generic * Update convert_hf_to_gguf.py * Apply suggestions from code review * fix regression * fix style --------- Co-authored-by: kooshi <1934337+kooshi@users.noreply.github.com>
2025-08-17 13:40:55 -04:00 · 2025-07-08 10:24:06 +02:00
parent 53903ae6fa
commit 8f22dc0a53
12 changed files with 449 additions and 0 deletions
--- a/src/llama-arch.h
+++ b/src/llama-arch.h
@@ -82,6 +82,7 @@ enum llm_arch {
    LLM_ARCH_DOTS1,
    LLM_ARCH_ARCEE,
    LLM_ARCH_ERNIE4_5,
+    LLM_ARCH_HUNYUAN_MOE,
    LLM_ARCH_UNKNOWN,
 };