0cc4m
5fd89a70ea
Vulkan Optimizations and Fixes (#8959)
* Optimize Vulkan REPEAT performance
* Use Vulkan GLSL fused multiply-add instruction where possible
* Add GGML_VULKAN_PERF option to output performance data per operator
* Rework and fix Vulkan descriptor set and descriptor pool handling
* Fix float32 concat f16 shader validation error
* Add Vulkan GROUP_NORM eps parameter
* Fix validation error with transfer queue memory barrier flags
* Remove trailing whitespaces
2024-08-14 18:32:53 +02:00
..
2024-08-13 21:13:15 +02:00
2024-08-13 21:13:15 +02:00
2024-08-13 21:13:15 +02:00
2024-06-26 18:33:02 +03:00
2024-08-13 21:13:15 +02:00
2024-07-10 15:23:29 +03:00
2024-08-14 18:32:53 +02:00
2024-08-14 18:32:53 +02:00
2024-08-08 13:19:31 +03:00
2024-07-12 10:46:02 +03:00
2024-07-27 04:41:55 +02:00
2024-06-26 18:33:02 +03:00
2024-08-07 13:29:02 +02:00
2024-07-27 04:41:55 +02:00
2024-08-06 12:42:42 +08:00
2024-07-28 01:41:25 +02:00
2024-08-07 13:29:02 +02:00
2024-08-03 18:34:41 +02:00
2024-07-27 04:41:55 +02:00
2024-08-13 21:13:15 +02:00
2024-07-19 17:17:27 +02:00
2024-08-03 18:34:41 +02:00
2024-08-03 18:34:41 +02:00
2024-08-09 23:03:21 +03:00
2024-07-30 14:56:51 +08:00
2024-08-14 18:32:53 +02:00
2024-08-13 21:13:15 +02:00