mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-07-01 13:05:52 +00:00

Files

Xuan-Son Nguyen 7841fc723e llama : Add Gemma 3 support (+ experimental vision capability) (#12343 )

* llama : Add Gemma 3 text-only support

* fix python coding style

* fix compile on ubuntu

* python: fix style

* fix ubuntu compile

* fix build on ubuntu (again)

* fix ubuntu build, finally

* clip : Experimental support for Gemma 3 vision (#12344)

* clip : Experimental support for Gemma 3 vision

* fix build

* PRId64

2025-03-12 09:30:24 +01:00

592 B

Raw Permalink Blame History

Gemma 3 vision

Important

This is very experimental, only used for demo purpose.

How to get mmproj.gguf?

cd gemma-3-4b-it
python ../llama.cpp/examples/llava/gemma3_convert_encoder_to_gguf.py .

# output file is mmproj.gguf

How to run it?

What you need:

The text model GGUF, can be converted using convert_hf_to_gguf.py
The mmproj file from step above
An image file

# build
cmake -B build
cmake --build build --target llama-gemma3-cli

# run it
./build/bin/llama-gemma3-cli -m {text_model}.gguf --mmproj mmproj.gguf --image your_image.jpg

592 B Raw Permalink Blame History

Gemma 3 vision

How to get mmproj.gguf?

How to run it?

592 B

Raw Permalink Blame History