Merge branch 'master' into compilade/bitnet-ternary

This commit is contained in:
Francis Couture-Harpin
2024-08-22 16:42:24 -04:00
77 changed files with 4681 additions and 2212 deletions

View File

@ -34,7 +34,7 @@ Run the quantized model:
```bash
# start inference on a gguf model
./llama-cli -m ./models/mymodel/ggml-model-Q4_K_M.gguf -n 128
./llama-cli -m ./models/mymodel/ggml-model-Q4_K_M.gguf -cnv -p "You are a helpful assistant"
```
When running the larger models, make sure you have enough disk space to store all the intermediate files.