This website requires JavaScript.
Explore
Help
Sign In
tqcq
/
llama.cpp
Watch
0
Star
0
Fork
0
You've already forked llama.cpp
mirror of
https://github.com/ggml-org/llama.cpp.git
synced
2025-07-26 03:03:25 -04:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
1f0fea70fb761d10e2264cbdcf4852ed32706c89
llama.cpp
/
ggml
/
include
History
Francis Couture-Harpin
1f0fea70fb
llama : initial Mamba-2 support
2024-08-21 18:00:34 -04:00
..
ggml-alloc.h
…
ggml-backend.h
CUDA: fix partial offloading for ne0 % 256 != 0 (
#8572
)
2024-07-18 23:48:47 +02:00
ggml-blas.h
…
ggml-cann.h
[CANN] Add Ascend NPU backend (
#6035
)
2024-07-17 14:23:50 +03:00
ggml-cuda.h
feat: Support Moore Threads GPU (
#8383
)
2024-07-28 01:41:25 +02:00
ggml-kompute.h
…
ggml-metal.h
metal : add abort callback (ggml/905)
2024-08-08 13:19:30 +03:00
ggml-rpc.h
…
ggml-sycl.h
…
ggml-vulkan.h
…
ggml.h
llama : initial Mamba-2 support
2024-08-21 18:00:34 -04:00