llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-07-26 11:13:53 -04:00

Files

lhez 300907b211 opencl: Fix rope and softmax (#11833 )

* opencl: fix `ROPE`

* opencl: fix `SOFT_MAX`

* Add fp16 variant

* opencl: enforce subgroup size for `soft_max`

2025-02-14 12:12:23 -07:00

…

2025-02-12 10:06:53 -04:00

2025-02-14 12:12:23 -07:00

.gitignore

…

CMakeLists.txt

2025-02-04 12:59:15 +02:00