Files
llama.cpp/examples
Kawrakow 1d0331c12a quantize: options for output and token embedding tensors qtype (#6239)
* quantize: be able to specify the output tensor type

* quantize: be able to specify the token embedding tensor type

---------

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
2024-03-22 20:47:14 +02:00
..
2024-03-22 09:36:03 +02:00
2024-03-14 20:29:32 +02:00
2024-02-16 11:31:07 +02:00
2024-02-16 11:31:07 +02:00
2024-02-16 11:31:07 +02:00
2024-02-16 11:31:07 +02:00
2023-03-29 20:21:09 +03:00
2023-08-30 09:29:32 +03:00
2024-03-07 11:41:53 +02:00