llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-14 12:19:48 -04:00

Files

Reese Levine 587d0118f5 ggml: WebGPU backend host improvements and style fixing (#14978 )

* Add parameter buffer pool, batching of submissions, refactor command building/submission

* Add header for linux builds

* Free staged parameter buffers at once

* Format with clang-format

* Fix thread-safe implementation

* Use device implicit synchronization

* Update workflow to use custom release

* Remove testing branch workflow

2025-08-04 08:52:43 -07:00

cmake

cmake : Fix BLAS link interface (ggml/1316)

2025-07-30 17:33:11 +03:00

include

ggml: Add initial WebGPU backend (#14521 )

2025-07-16 18:18:51 +03:00

src

ggml: WebGPU backend host improvements and style fixing (#14978 )

2025-08-04 08:52:43 -07:00

.gitignore

…

CMakeLists.txt

HIP: add GGML_HIP_MMQ_MFMA option to allow disableing the MFMA path. (#14930 )

2025-07-29 17:44:30 +02:00