llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-28 11:08:19 -04:00

Files

Paul Tsochantaris 96b6912103 metal : single allocation of encode_async block (#9747 )

* Single allocation of encode_async block with non-ARC capture in ggml-metal.m

* Moving Block_release to the deallocation code

* Release encode block when re-setting encoding buffer count if needed

* Update ggml/src/ggml-metal.m

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

2024-10-07 15:26:31 +03:00

cmake

llama : reorganize source code + improve CMake (#8006 )

2024-06-26 18:33:02 +03:00

include

ggml : fix typo in example usage ggml_gallocr_new (ggml/984)

2024-10-04 18:50:05 +03:00

src

metal : single allocation of encode_async block (#9747 )

2024-10-07 15:26:31 +03:00

.gitignore

vulkan : cmake integration (#8119 )

2024-07-13 18:12:39 +02:00

CMakeLists.txt

cmake : do not hide GGML options + rename option (#9465 )

2024-09-16 10:27:50 +03:00