llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-17 13:40:55 -04:00

Files

slaren be55695eff ggml-backend : fix async copy from CPU (#8897 )

* ggml-backend : fix async copy from CPU

* cuda : more reliable async copy, fix stream used when the devices are the same

2024-08-07 13:29:02 +02:00

2024-06-26 18:33:02 +03:00

2024-08-06 10:26:46 +03:00

2024-08-07 13:29:02 +02:00

.gitignore

2024-07-13 18:12:39 +02:00

CMakeLists.txt

cann: update cmake (#8765 )

2024-07-30 12:37:35 +02:00