llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-07-26 11:13:53 -04:00

Files

Aman Gupta 0a5a3b5cdf Add Conv2d for CPU (#14388 )

* Conv2D: Add CPU version

* Half decent

* Tiled approach for F32

* remove file

* Fix tests

* Support F16 operations

* add assert about size

* Review: further formatting fixes, add assert and use CPU version of fp32->fp16

2025-06-30 23:57:04 +08:00

ggml-alloc.h

ggml : upgrade init_tensor API to return a ggml_status (#11854 )

2025-02-28 14:41:47 +01:00

ggml-backend.h

vulkan: Add fusion support for RMS_NORM+MUL (#14366 )

2025-06-29 09:43:36 +02:00

ggml-blas.h

…

ggml-cann.h

…

ggml-cpp.h

ggml : fix ggml_gallocr_ptr type (ggml/1205)

2025-05-01 09:58:44 +03:00

ggml-cpu.h

ggml : add ggml_set_rows (#14274 )

2025-06-27 16:41:40 +03:00

ggml-cuda.h

…

ggml-kompute.h

…

ggml-metal.h

repo : update links to new url (#11886 )

2025-02-15 16:40:57 +02:00

ggml-opencl.h

…

ggml-opt.h

mnist: fix segmentation fault (ggml/1227)

2025-05-19 13:29:56 +03:00

ggml-rpc.h

rpc : do not wait for response when sending RPC_CMD_SET_TENSOR (#12943 )

2025-04-25 10:08:08 +03:00

ggml-sycl.h

…

ggml-vulkan.h

…

ggml.h

Add Conv2d for CPU (#14388 )

2025-06-30 23:57:04 +08:00

gguf.h

…