llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-07-26 19:23:37 -04:00

Files

Henry Linjamäki edbf42edfd opencl: fix couple crashes (#12795 )

* opencl: fix couple crashes

* fix kernel launches failed on devices which do not support
  non-uniform work-groups. When non-uniform work-groups are not
  supported, set `local_work_size` to NULL (= let driver choose the
  work-group sizes). This patch does not cover everything - just the
  cases tested by test-backend-ops.

* fix sub-buffer creation failed due to `cl_buffer_region::origin` not
  being aligned to `CL_DEVICE_MEM_BASE_ADDR_ALIGN`.

* OpenCL: query non-uniform WG sizes only on OpenCL 3.0+

2025-05-21 13:21:17 -07:00

cmake

scripts : update sync + fix cmake merge

2025-03-27 10:09:29 +02:00

include

ggml : add ggml_gelu_erf() (#13667 )

2025-05-21 16:26:33 +02:00

src

opencl: fix couple crashes (#12795 )

2025-05-21 13:21:17 -07:00

.gitignore

…

CMakeLists.txt

sycl: use oneDNN for matrices multiplication (#12972 )

2025-05-15 16:53:41 +02:00