Files
llama.cpp/ggml
Francis Couture-Harpin f8c7caeeb7 cuda : implement ssm scan for Mamba2
There is still room for improvement, but it works!

* cuda : adapt Mamba1 ssm scan to shape changes from Mamba2
2025-06-19 01:56:04 -04:00
..
2025-06-19 01:56:04 -04:00
2024-07-13 18:12:39 +02:00