llama.cpp/conv2d-transpose.cuh at 0aedae00e6fb48680324a5ac5da9cba0e35de6b5 - llama.cpp - Cat's Mantra

tqcq/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-17 13:40:55 -04:00

Files

Aman Gupta c959f462a0 CUDA: add conv_2d_transpose (#14287 )

* CUDA: add conv_2d_transpose

* remove direct include of cuda_fp16

* Review: add brackets for readability, remove ggml_set_param and add asserts

2025-06-20 22:48:24 +08:00

5 lines

157 B

Plaintext

Raw Blame History

 #include "common.cuh"
 #define CUDA_CONV2D_TRANSPOSE_BLOCK_SIZE 256
 void ggml_cuda_conv_2d_transpose_p0(ggml_backend_cuda_context & ctx, ggml_tensor * dst);