llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-14 04:17:53 -04:00

Files

Reese Levine 9515c6131a ggml: WebGPU disable SET_ROWS for now (#15078 )

* Add paramater buffer pool, batching of submissions, refactor command building/submission

* Add header for linux builds

* Free staged parameter buffers at once

* Format with clang-format

* Fix thread-safe implementation

* Use device implicit synchronization

* Update workflow to use custom release

* Remove testing branch workflow

* Disable set_rows until it's implemented

* Fix potential issue around empty queue submission

* Try synchronous submission

* Try waiting on all futures explicitly

* Add debug

* Add more debug messages

* Work on getting ssh access for debugging

* Debug on failure

* Disable other tests

* Remove extra if

* Try more locking

* maybe passes?

* test

* Some cleanups

* Restore build file

* Remove extra testing branch ci

2025-08-05 16:26:38 -07:00

actions

releases : use arm version of curl for arm releases (#13592 )

2025-05-16 19:36:51 +02:00

ISSUE_TEMPLATE

ggml : remove kompute backend (#14501 )

2025-07-03 07:48:32 +03:00

workflows

ggml: WebGPU disable SET_ROWS for now (#15078 )

2025-08-05 16:26:38 -07:00

labeler.yml

ggml : remove kompute backend (#14501 )

2025-07-03 07:48:32 +03:00

pull_request_template.md

repo : update links to new url (#11886 )

2025-02-15 16:40:57 +02:00