luoyu-intel
d08c20edde
[SYCL] Fix the sub group size of Intel (#8106)
* use warp_size macro for all sycl kernels
* fix mask of permute_sub_group_by_xor
* fix rms_norm with correct warp number
* fix rms_norm_f32/group_norm_f32
* move norm to norm.cpp file
* fix quantize bug
* fix mmvq's batch size
2024-07-02 10:16:00 +08:00
..
2024-06-26 18:33:02 +03:00
2024-07-02 10:16:00 +08:00
2024-06-26 18:33:02 +03:00
2024-07-02 10:16:00 +08:00
2024-06-26 18:33:02 +03:00
2024-06-26 18:33:02 +03:00
2024-06-26 18:33:02 +03:00
2024-07-02 10:16:00 +08:00
2024-06-26 18:33:02 +03:00
2024-06-26 18:33:02 +03:00
2024-06-26 18:33:02 +03:00
2024-07-02 10:16:00 +08:00
2024-06-26 18:33:02 +03:00
2024-07-02 10:16:00 +08:00
2024-07-02 10:16:00 +08:00
2024-07-02 10:16:00 +08:00
2024-07-01 19:39:06 +08:00
2024-07-01 19:39:06 +08:00
2024-07-01 20:39:06 +02:00