llama : one-off chat template fix for Mistral-Small-2503 (#13398)

* llama : one-off chat template fix for Mistral-Small-2503 * update readme * add mistral-v7-tekken
2025-08-18 05:56:00 -04:00 · 2025-05-09 11:17:51 +02:00
parent b486ba05bf
commit 3f96aeff39
4 changed files with 18 additions and 7 deletions
--- a/tools/mtmd/README.md
+++ b/tools/mtmd/README.md
@@ -46,7 +46,7 @@ llama-mtmd-cli -hf ggml-org/Qwen2.5-VL-32B-Instruct-GGUF
 llama-mtmd-cli -hf ggml-org/Qwen2.5-VL-72B-Instruct-GGUF

 # Mistral Small 3.1 24B (IQ2_M quantization)
-llama-mtmd-cli -hf ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF --chat-template mistral-v7
+llama-mtmd-cli -hf ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF
 ```

 ## How it works and what is `mmproj`?