llama.cpp/tests at cd108e641dbdedd8c5641c4cec1762f751f38136 - llama.cpp - Cat's Mantra

tqcq/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-09-01 12:52:17 -04:00

Files

History

Johannes Gäßler 8f900abfc0 CUDA: faster softmax via shared memory + fp16 math (#4742 )

2024-01-09 08:58:55 +01:00

..

CMakeLists.txt

cmake : fix ld warning duplicate libraries libllama.a (#4671 )

2023-12-29 16:39:15 +02:00

test-backend-ops.cpp

CUDA: faster softmax via shared memory + fp16 math (#4742 )

2024-01-09 08:58:55 +01:00

test-c.c

…

test-double-float.cpp

…

test-grad0.cpp

cuda : improve cuda pool efficiency using virtual memory (#4606 )

2023-12-24 14:34:22 +01:00

test-grammar-parser.cpp

…

test-llama-grammar.cpp

…

test-opt.cpp

…

test-quantize-fns.cpp

SOTA 2-bit quants (#4773 )

2024-01-08 16:02:32 +01:00

test-quantize-perf.cpp

ggml : use ggml_row_size where possible (#4472 )

2023-12-14 20:05:21 +01:00

test-rope.cpp

…

test-sampling.cpp

…

test-tokenizer-0-falcon.cpp

…

test-tokenizer-0-falcon.py

ci : add flake8 to github actions (python linting) (#4129 )

2023-11-20 11:35:47 +01:00

test-tokenizer-0-llama.cpp

…

test-tokenizer-0-llama.py

ci : add flake8 to github actions (python linting) (#4129 )

2023-11-20 11:35:47 +01:00

test-tokenizer-1-bpe.cpp

…

test-tokenizer-1-llama.cpp

…