OpenCL Token Generation Acceleration (#1459)

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-17 13:40:55 -04:00

* Move back to C++ for OpenCL

* Refactor OpenCL code to work more like the CUDA code, add missing functions

* Deduplicate dequant kernels

* Add OpenCL compile options

* Use compile args for preprocessing constants

* Restore default platform + device selection by id behavior

---------

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>
Co-authored-by: Henri Vasserman <henv@hot.ee>

This commit is contained in:

0cc4m

2023-05-22 23:33:24 +02:00

committed by

GitHub

parent 7e4ea5beff

commit 2e6cd4b025

8 changed files with 1113 additions and 536 deletions

1034

ggml-opencl.cpp Normal file

View File

File diff suppressed because it is too large Load Diff

OpenCL Token Generation Acceleration (#1459)

1034 ggml-opencl.cpp Normal file View File

1034

ggml-opencl.cpp Normal file

View File