[CANN] Adapt to dynamically loadable backends mechanism (#9970)

* [CANN] Adapt to dynamically loadable backends mechanism * Fix the Bug: inference running result is garbled in debug running model for LM models who's type is Q4_0 class * Handle the review comments of this pull request
2025-08-11 11:05:39 -04:00 · 2024-10-22 16:16:01 +08:00
parent 674804a996
commit 6b8447352d
4 changed files with 267 additions and 149 deletions
--- a/ggml/include/ggml-cann.h
+++ b/ggml/include/ggml-cann.h
@@ -34,6 +34,8 @@ extern "C" {
 */
 #define GGML_CANN_MAX_DEVICES 16

+GGML_API ggml_backend_reg_t ggml_backend_cann_reg(void);
+
 /**
 * @brief Initializes the CANN backend for a specified device.
 *