llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-06-28 12:25:03 +00:00

Files

Xuan-Son Nguyen 92ecdcc06a mtmd : add vision support for llama 4 (#13282 )

* wip llama 4 conversion

* rm redundant __init__

* fix conversion

* fix conversion

* test impl

* try this

* reshape patch_embeddings_0

* fix view

* rm ffn_post_norm

* cgraph ok

* f32 for pos embd

* add image marker tokens

* Llama4UnfoldConvolution

* correct pixel shuffle

* fix merge conflicts

* correct

* add debug_graph

* logits matched, but it still preceives the image incorrectly

* fix style

* add image_grid_pinpoints

* handle llama 4 preprocessing

* rm load_image_size

* rm unused line

* fix

* small fix 2

* add test & docs

* fix llava-1.6 test

* test: add notion of huge models

* add comment

* add warn about degraded quality

2025-05-19 13:04:14 +02:00

backend

sycl: use oneDNN for matrices multiplication (#12972 )

2025-05-15 16:53:41 +02:00

development

llama : move end-user examples to tools directory (#13249 )

2025-05-02 20:27:13 +02:00

multimodal

mtmd : rename llava directory to mtmd (#13311 )

2025-05-05 16:02:55 +02:00

android.md

repo : update links to new url (#11886 )