llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-06-26 19:55:04 +00:00

Files

yuiseki 5d5c066de8 mtmd : fix Pixtral OOM with large images by capping image_size to 1024 (#14326 )

Mistral Small 2506 models using Pixtral vision encoder were running out
of GPU memory when processing images larger than 1024x1024 pixels due to
exponential memory growth from unlimited image size.

This fix applies the same 1024x1024 limit used by Qwen2VL models to
prevent OOM issues while maintaining compatibility with existing models.

2025-06-22 14:44:57 +02:00

batched-bench

llama : deprecate llama_kv_self_ API (#14030 )

2025-06-06 14:11:15 +03:00

cvector-generator

llama : deprecate llama_kv_self_ API (#14030 )

2025-06-06 14:11:15 +03:00

export-lora

llama : move end-user examples to tools directory (#13249 )

2025-05-02 20:27:13 +02:00

gguf-split

llama : move end-user examples to tools directory (#13249 )