ggml : quantization refactoring (#3833)

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-17 21:51:27 -04:00

* ggml : factor all quantization code in ggml-quants

ggml-ci

* ggml-quants : fix Zig and Swift builds + quantize tool

ggml-ci

* quantize : --pure option for disabling k-quant mixtures

---------

Co-authored-by: cebtenzzre <cebtenzzre@gmail.com>

This commit is contained in:

Georgi Gerganov

2023-10-29 18:32:28 +02:00

committed by

GitHub

parent ff3bad83e2

commit d69d777c02

11 changed files with 2372 additions and 2385 deletions

7280

ggml-quants.c Normal file

View File

File diff suppressed because it is too large Load Diff

ggml : quantization refactoring (#3833)

7280 ggml-quants.c Normal file View File

7280

ggml-quants.c Normal file

View File