llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-07-31 06:34:56 -04:00

Files

Diego Devesa 10bce0450f llama : accept a list of devices to use to offload a model (#10497 )

* llama : accept a list of devices to use to offload a model

* accept `--dev none` to completely disable offloading

* fix dev list with dl backends

* rename env parameter to LLAMA_ARG_DEVICE for consistency

2024-11-25 19:30:06 +01:00

llama.h

llama : accept a list of devices to use to offload a model (#10497 )

2024-11-25 19:30:06 +01:00