llama.cpp/ggml at b044a0fe3ca0cbef9dd041edce3ebda8c501fae4 - llama.cpp - Cat's Mantra

tqcq/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-09-17 12:46:47 -04:00

Files

History

Wagner Bruna b044a0fe3c vulkan: add environment variable GGML_VK_PREFER_HOST_MEMORY to avoid VRAM allocation (#11592 )

2025-02-10 07:08:22 +01:00

..

cmake: add ggml find package (#11369 )

2025-01-26 12:07:48 -04:00

CUDA: use mma PTX instructions for FlashAttention (#11583 )

2025-02-02 19:31:09 +01:00

vulkan: add environment variable GGML_VK_PREFER_HOST_MEMORY to avoid VRAM allocation (#11592 )

2025-02-10 07:08:22 +01:00

.gitignore

vulkan : cmake integration (#8119 )

2024-07-13 18:12:39 +02:00

CMakeLists.txt

cmake: Add ability to pass in GGML_BUILD_NUMBER (ggml/1096)

2025-02-04 12:59:15 +02:00