ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels (#9217)

* ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels

* added fallback mechanism when the offline re-quantized model is not
optimized for the underlying target.

* fix for build errors

* remove prints from the low-level code

* Rebase to the latest upstream
This commit is contained in:
Charles Xu
2024-09-25 15:12:20 +02:00
committed by GitHub
parent afbbfaa537
commit 1e43630218

File diff suppressed because it is too large Load Diff