llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-07-22 10:48:12 +00:00

Files

Georgi Gerganov 7b53389c24 metal : add memory pool for temp allocs (#12850 )

* metal : add memory pool for temp allocs (wip) [no ci]

* cont : free buffers from the heap

* cont : resize heap [no ci]

* cont : refactor heap [no ci]

* cont : heap for each cmd buffer [no ci]

* cont : fix free

* wip

* cont : fix alignment [no ci]

* cont : not working .. [no ci]

* cont : heap allocation now works [no ci]

* cont : use MTLHeapTypePlacement

ggml-ci

* metal : use dynamic MTLHeap allocations

ggml-ci

* metal : add comments

* metal : disable softmax use of mem_pool

ggml-ci

* metal : final touches

2025-04-22 16:15:51 +03:00

cmake

…

include

…

src

metal : add memory pool for temp allocs (#12850 )

2025-04-22 16:15:51 +03:00

.gitignore

…

CMakeLists.txt

…