llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-17 13:40:55 -04:00

Files

Eve 7b1ec53f56 vulkan: bugfixes for small subgroup size systems + llvmpipe test (#10809 )

* ensure mul mat shaders work on systems with subgroup size less than 32

more fixes

add test

* only s_warptile_mmq needs to be run with 32 threads or more

2024-12-17 06:52:55 +01:00

include

llama : add Qwen2VL support + multimodal RoPE (#10361 )

2024-12-14 14:43:46 +02:00

src

vulkan: bugfixes for small subgroup size systems + llvmpipe test (#10809 )

2024-12-17 06:52:55 +01:00

.gitignore

vulkan : cmake integration (#8119 )

2024-07-13 18:12:39 +02:00

CMakeLists.txt

Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs (#10693 )

2024-12-13 12:23:52 -08:00