Add LLaDA 8b Diffusion model (#14771)

* Add support for Llada-8b: diffusion model * Add README * Fix README and convert_hf_to_gguf * convert_hf_to_gguf.py: address review comments * Make everything in a single example * Remove model-specific sampling * Remove unused argmax * Remove braced initializers, improve README.md a bit * Add diffusion specific gguf params in set_vocab, remove setting rope_theta and rms_norm_eps * Remove adding the mask token * Move add_add_bos_token to set_vocab * use add_bool in gguf_writer.py
2025-09-03 13:48:51 -04:00 · 2025-07-31 19:49:09 +08:00
parent 11490b3672
commit 8a4a856277
12 changed files with 931 additions and 385 deletions
--- a/src/llama-arch.h
+++ b/src/llama-arch.h
@@ -93,6 +93,7 @@ enum llm_arch {
    LLM_ARCH_LFM2,
    LLM_ARCH_DREAM,
    LLM_ARCH_SMALLTHINKER,
+    LLM_ARCH_LLADA,
    LLM_ARCH_UNKNOWN,
 };