This website requires JavaScript.
Explore
Help
Sign In
tqcq
/
llama.cpp
Watch
0
Star
0
Fork
0
You've already forked llama.cpp
mirror of
https://github.com/ggml-org/llama.cpp.git
synced
2025-09-01 12:52:17 -04:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
cd108e641dbdedd8c5641c4cec1762f751f38136
llama.cpp
/
tests
History
Johannes Gäßler
8f900abfc0
CUDA: faster softmax via shared memory + fp16 math (
#4742
)
2024-01-09 08:58:55 +01:00
..
CMakeLists.txt
cmake : fix ld warning duplicate libraries libllama.a (
#4671
)
2023-12-29 16:39:15 +02:00
test-backend-ops.cpp
CUDA: faster softmax via shared memory + fp16 math (
#4742
)
2024-01-09 08:58:55 +01:00
test-c.c
…
test-double-float.cpp
…
test-grad0.cpp
cuda : improve cuda pool efficiency using virtual memory (
#4606
)
2023-12-24 14:34:22 +01:00
test-grammar-parser.cpp
…
test-llama-grammar.cpp
…
test-opt.cpp
…
test-quantize-fns.cpp
SOTA 2-bit quants (
#4773
)
2024-01-08 16:02:32 +01:00
test-quantize-perf.cpp
ggml : use ggml_row_size where possible (
#4472
)
2023-12-14 20:05:21 +01:00
test-rope.cpp
…
test-sampling.cpp
…
test-tokenizer-0-falcon.cpp
…
test-tokenizer-0-falcon.py
ci : add flake8 to github actions (python linting) (
#4129
)
2023-11-20 11:35:47 +01:00
test-tokenizer-0-llama.cpp
…
test-tokenizer-0-llama.py
ci : add flake8 to github actions (python linting) (
#4129
)
2023-11-20 11:35:47 +01:00
test-tokenizer-1-bpe.cpp
…
test-tokenizer-1-llama.cpp
…