llama.cpp/ggml-webgpu.h at e4868d16d24dec55e61bcaadaca28feed8f98b13 - llama.cpp - Cat's Mantra

tqcq/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-07-27 11:43:35 -04:00

Files

Reese Levine 21c021745d ggml: Add initial WebGPU backend (#14521 )

* Minimal setup of webgpu backend with dawn. Just prints out the adapter and segfaults

* Initialize webgpu device

* Making progress on setting up the backend

* Finish more boilerplate/utility functions

* Organize file and work on alloc buffer

* Add webgpu_context to prepare for actually running some shaders

* Work on memset and add shader loading

* Work on memset polyfill

* Implement set_tensor as webgpu WriteBuffer, remove host_buffer stubs since webgpu doesn't support it

* Implement get_tensor and buffer_clear

* Finish rest of setup

* Start work on compute graph

* Basic mat mul working

* Work on emscripten build

* Basic WebGPU backend instructions

* Use EMSCRIPTEN flag

* Work on passing ci, implement 4d tensor multiplication

* Pass thread safety test

* Implement permuting for mul_mat and cpy

* minor cleanups

* Address feedback

* Remove division by type size in cpy op

* Fix formatting and add github action workflows for vulkan and metal (m-series) webgpu backends

* Fix name

* Fix macos dawn prefix path

2025-07-16 18:18:51 +03:00

20 lines

328 B

C

Raw Blame History

 #pragma once
 #include "ggml.h"
 #include "ggml-backend.h"
 #ifdef  __cplusplus
 extern "C" {
 #endif
 #define GGML_WEBGPU_NAME "WebGPU"
 // Needed for examples in ggml
 GGML_BACKEND_API ggml_backend_t ggml_backend_webgpu_init(void);
 GGML_BACKEND_API ggml_backend_reg_t ggml_backend_webgpu_reg(void);
 #ifdef  __cplusplus
 }
 #endif