This website requires JavaScript.
Explore
Help
Sign In
tqcq
/
llama.cpp
Watch
0
Star
0
Fork
0
You've already forked llama.cpp
mirror of
https://github.com/ggml-org/llama.cpp.git
synced
2025-07-28 13:20:27 -04:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
5c86c9ed3ef1cc7307fdce05f0f0e2e45253cf90
llama.cpp
/
ggml
History
Johannes Gäßler
5c86c9ed3e
CUDA: fix crash on large batch size for MoE models (
#13384
)
2025-05-09 12:14:04 +02:00
..
cmake
scripts : update sync + fix cmake merge
2025-03-27 10:09:29 +02:00
include
CUDA: fix bad asserts for partial offload (
#13337
)
2025-05-06 13:58:51 +02:00
src
CUDA: fix crash on large batch size for MoE models (
#13384
)
2025-05-09 12:14:04 +02:00
.gitignore
…
CMakeLists.txt
whisper: remove MSVC warnings pragmas (whisper/3090)
2025-05-07 17:28:36 +03:00