llama.cpp/examples at b1806 - llama.cpp - Cat's Mantra

tqcq/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-09 10:32:38 -04:00

Files

History

John d34633d8db clip : support more quantization types (#4846 )

Uses ggml functions instead of hardcoded names and adds support to quantize into the modern Q-K variants.
This is just the bare minimum to get k-types working - a more refined choice of types would be needed to get best quality on low quantizations.

I ran a few tests, it doesn't break anything I could notice and a Q6_K ViT works almost as well as Q8_0 but 3 times the inference speed.

2024-01-10 15:37:09 +02:00

..

ggml : change ggml_scale to take a float instead of tensor (#4573 )

2023-12-21 23:20:49 +02:00

examples : add passkey test (#3856 )

2024-01-08 11:14:04 +02:00

…

…

…

ggml : add ggml_row_size() (fixes llama out of space) (#4461 )

2023-12-14 14:13:33 +02:00

convert-llama2c-to-ggml

ggml : remove n_dims from ggml_tensor (#4469 )

2023-12-14 16:52:08 +01:00

build : link against build info instead of compiling against it (#3879 )

2023-11-02 08:50:16 +02:00

ggml : change ggml_scale to take a float instead of tensor (#4573 )

2023-12-21 23:20:49 +02:00

finetune : remove unused includes (#4756 )

2024-01-04 21:45:37 +02:00

gguf : simplify example dependencies

2023-12-21 23:08:14 +02:00

…

…

llama-bench : add no-kv-offload parameter (#4812 )

2024-01-07 17:59:01 +01:00

llama.swiftui : update readme

2024-01-08 15:57:36 +02:00

clip : support more quantization types (#4846 )

2024-01-10 15:37:09 +02:00

…

lookup : add prompt lookup decoding example (#4484 )

2023-12-22 18:05:56 +02:00

main : add self-extend support (#4815 )

2024-01-08 11:18:32 +02:00

main-cmake-pkg : fix build issue (#4665 )

2023-12-29 16:18:20 +02:00

…

…

examples : add passkey test (#3856 )

2024-01-08 11:14:04 +02:00

…

…

…

save-load-state

…

server : update readme about token probs (#4777 )

2024-01-09 12:02:05 +02:00

…

…

…

train-text-from-scratch

ggml : change ggml_scale to take a float instead of tensor (#4573 )

2023-12-21 23:20:49 +02:00

alpaca.sh

…

base-translate.sh

examples : improve base-translate.sh script (#4783 )

2024-01-06 11:40:24 +02:00

chat-13B.bat

…

chat-13B.sh

…

chat-persistent.sh

…

chat-vicuna.sh

…

chat.sh

…

CMakeLists.txt

examples : add passkey test (#3856 )

2024-01-08 11:14:04 +02:00

gpt4all.sh

…

json-schema-to-grammar.py

…

llama2-13b.sh

…

llama2.sh

…

llama.vim

…

llm.vim

…

make-ggml.py

…

Miku.sh

…

reason-act.sh

…

server-llama2-13B.sh

…