metal : fix memory leak (#2762)

* metal : fix memory leak * metal : fix encoders memory leak * metal : clean up more memory resources * metal : fix more leaks * metal : reuse dispatch queue + autoreleasepool * metal : reuse array for command buffers and encoders * ggml : assert for odd number of blocks on ARM 15M tinyllama is an example
2025-09-02 05:14:11 -04:00 · 2023-08-28 10:59:08 +03:00
parent ebcee207b6
commit f55538c3cc
3 changed files with 88 additions and 24 deletions
--- a/ggml-metal.h
+++ b/ggml-metal.h
@@ -24,6 +24,7 @@

 // max memory buffers that can be mapped to the device
 #define GGML_METAL_MAX_BUFFERS 16
+#define GGML_METAL_MAX_COMMAND_BUFFERS 32

 struct ggml_tensor;
 struct ggml_cgraph;