Files
llama.cpp/common
Diego Devesa ec428b02c3 llama : add --n-cpu-moe option (#15077)
* llama : add --n-cpu-moe option

Keeps the MoE weights of the first N layers in the CPU
2025-08-05 01:05:36 +02:00
..
2025-05-30 16:25:45 +03:00
2025-05-30 16:25:45 +03:00