0cc4m
5fd89a70ea
Vulkan Optimizations and Fixes (#8959)
* Optimize Vulkan REPEAT performance
* Use Vulkan GLSL fused multiply-add instruction where possible
* Add GGML_VULKAN_PERF option to output performance data per operator
* Rework and fix Vulkan descriptor set and descriptor pool handling
* Fix float32 concat f16 shader validation error
* Add Vulkan GROUP_NORM eps parameter
* Fix validation error with transfer queue memory barrier flags
* Remove trailing whitespaces
2024-08-14 18:32:53 +02:00
..
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-06 15:21:47 +02:00
2024-08-14 18:32:53 +02:00
2024-08-05 08:50:57 +03:00
2024-07-23 10:56:49 +02:00
2024-07-23 10:56:49 +02:00
2024-07-23 10:56:49 +02:00
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-14 18:32:53 +02:00
2024-08-14 18:32:53 +02:00
2024-08-14 18:32:53 +02:00
2024-08-14 18:32:53 +02:00
2024-08-14 18:32:53 +02:00
2024-08-14 18:32:53 +02:00
2024-08-14 18:32:53 +02:00
2024-08-14 18:32:53 +02:00
2024-08-14 18:32:53 +02:00
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-14 18:32:53 +02:00
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-05 08:50:57 +03:00
2024-08-14 18:32:53 +02:00