This website requires JavaScript.
Explore
Help
Sign In
tqcq
/
llama.cpp
Watch
0
Star
0
Fork
0
You've already forked llama.cpp
mirror of
https://github.com/ggml-org/llama.cpp.git
synced
2025-08-08 18:04:54 -04:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
01b929491b50071a5d0572235dcf5a449da70aa7
llama.cpp
/
ggml
/
include
History
Aaron Teo
4a9f60c201
ggml-cpu: add nnpa compile flag
...
Signed-off-by: Aaron Teo <
aaron.teo1@ibm.com
>
2025-06-20 21:44:35 +08:00
..
ggml-alloc.h
ggml : upgrade init_tensor API to return a ggml_status (
#11854
)
2025-02-28 14:41:47 +01:00
ggml-backend.h
Add
--no-op-offload
to improve
-ot
pp perf in MoE models like llama4 400B (
#13386
)
2025-05-11 14:18:39 +02:00
ggml-blas.h
…
ggml-cann.h
…
ggml-cpp.h
ggml : fix ggml_gallocr_ptr type (ggml/1205)
2025-05-01 09:58:44 +03:00
ggml-cpu.h
ggml-cpu: add nnpa compile flag
2025-06-20 21:44:35 +08:00
ggml-cuda.h
…
ggml-kompute.h
…
ggml-metal.h
…
ggml-opencl.h
…
ggml-opt.h
mnist: fix segmentation fault (ggml/1227)
2025-05-19 13:29:56 +03:00
ggml-rpc.h
rpc : do not wait for response when sending RPC_CMD_SET_TENSOR (
#12943
)
2025-04-25 10:08:08 +03:00
ggml-sycl.h
…
ggml-vulkan.h
…
ggml.h
ggml : remove ggml_graph_import and ggml_graph_export declarations (ggml/1247)
2025-06-01 13:43:57 +03:00
gguf.h
…