llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-07-28 21:23:55 -04:00

Files

Jeff Bolz 1f7d50b293 vulkan: Track descriptor pools/sets per-context (#14109 )

Use the same descriptor set layout for all pipelines (MAX_PARAMETER_COUNT == 8)
and move it to the vk_device. Move all the descriptor pool and set tracking to
the context - none of it is specific to pipelines anymore. It has a single vector
of pools and vector of sets, and a single counter to track requests and a single
counter to track use.

2025-06-11 07:19:25 +02:00

cmake

cmake: Factor out CPU architecture detection (#13883 )

2025-05-29 12:50:25 +02:00

include

ggml : remove ggml_graph_import and ggml_graph_export declarations (ggml/1247)

2025-06-01 13:43:57 +03:00

src

vulkan: Track descriptor pools/sets per-context (#14109 )

2025-06-11 07:19:25 +02:00

.gitignore

vulkan : cmake integration (#8119 )

2024-07-13 18:12:39 +02:00

CMakeLists.txt

ggml-cpu : split arch-specific implementations (#13892 )

2025-06-09 16:47:13 +02:00