graph : normalize Q, K, V shapes + sync cross attention (#12449)

* graph : normalize Q, K, V shapes and add comments

ggml-ci

* context : synchronize before getting cross attention data

* model : fix command-r attention norm check
This commit is contained in:
Georgi Gerganov
2025-03-18 21:35:19 +02:00
committed by GitHub
parent bb115d2bf7
commit 75422e8bc4
4 changed files with 433 additions and 277 deletions

File diff suppressed because it is too large Load Diff