llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-09-05 06:45:21 -04:00

Files

Aman Gupta 27208bf657 CUDA: add bf16 and f32 support to cublas_mul_mat_batched (#14361 )

* CUDA: add bf16 and f32 support to cublas_mul_mat_batched

* Review: add type traits and make function more generic

* Review: make check more explicit, add back comments, and fix formatting

* Review: fix formatting, remove useless type conversion, fix naming for bools

2025-06-29 01:30:53 +08:00

cmake

ggml-cpu : rework weak alias on apple targets (#14146 )

2025-06-16 13:54:15 +08:00

include

ggml : add ggml_set_rows (#14274 )

2025-06-27 16:41:40 +03:00

src

CUDA: add bf16 and f32 support to cublas_mul_mat_batched (#14361 )

2025-06-29 01:30:53 +08:00

.gitignore

…

CMakeLists.txt

ggml-cpu: enable IBM NNPA Vector Intrinsics (#14317 )

2025-06-25 23:49:04 +02:00