llama.cpp/include at a9aedf46b4930d94cd6b79860af9700a58373023 - llama.cpp - Cat's Mantra

tqcq/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-09 18:40:20 -04:00

Files

History

Sigbjørn Skjæret 0b2703fc57 implement swapped variants (cpu/cuda)

2025-06-22 10:37:25 +05:30

..

ggml-alloc.h

…

ggml-backend.h

Add --no-op-offload to improve -ot pp perf in MoE models like llama4 400B (#13386 )

2025-05-11 14:18:39 +02:00

ggml-blas.h

…

ggml-cann.h

…

ggml-cpp.h

ggml : fix ggml_gallocr_ptr type (ggml/1205)

2025-05-01 09:58:44 +03:00

ggml-cpu.h

ggml: move fp16/bf16 conversion optimizations to CPU backend + export conversion APIs (#13107 )

2025-04-26 16:05:31 +02:00

ggml-cuda.h

…

ggml-kompute.h

…

ggml-metal.h

…

ggml-opencl.h

…

ggml-opt.h

mnist: fix segmentation fault (ggml/1227)

2025-05-19 13:29:56 +03:00

ggml-rpc.h

…

ggml-sycl.h

…

ggml-vulkan.h

…

ggml.h

implement swapped variants (cpu/cuda)

2025-06-22 10:37:25 +05:30

gguf.h

…