This website requires JavaScript.
Explore
Help
Sign In
tqcq
/
llama.cpp
Watch
0
Star
0
Fork
0
You've already forked llama.cpp
mirror of
https://github.com/ggml-org/llama.cpp.git
synced
2025-07-30 22:23:31 -04:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
ffc727203af1061fdeb49efef30f76171722e403
llama.cpp
/
ggml
/
include
History
Johannes Gäßler
2356fb1d53
CUDA: fix bad asserts for partial offload (
#13337
)
2025-05-06 13:58:51 +02:00
..
ggml-alloc.h
…
ggml-backend.h
CUDA: fix logic for clearing padding with -ngl 0 (
#13320
)
2025-05-05 22:32:13 +02:00
ggml-blas.h
…
ggml-cann.h
…
ggml-cpp.h
ggml : fix ggml_gallocr_ptr type (ggml/1205)
2025-05-01 09:58:44 +03:00
ggml-cpu.h
ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs (
#13107
)
2025-04-26 16:05:31 +02:00
ggml-cuda.h
…
ggml-kompute.h
…
ggml-metal.h
…
ggml-opencl.h
…
ggml-opt.h
…
ggml-rpc.h
rpc : do not wait for response when sending RPC_CMD_SET_TENSOR (
#12943
)
2025-04-25 10:08:08 +03:00
ggml-sycl.h
…
ggml-vulkan.h
…
ggml.h
CUDA: fix bad asserts for partial offload (
#13337
)
2025-05-06 13:58:51 +02:00
gguf.h
…