llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-14 20:29:41 -04:00

Files

Jeff Bolz e592be1575 vulkan: fix rms_norm+mul fusion (#14545 )

The fused operation was grabbing the epsilon value from the wrong place.

Add an env var to disable fusion.

Add some missing checks for supported shapes/types.

Handle fused rms_norm+mul in check_results.

2025-07-06 10:08:16 +02:00

.gitignore

…

CMakeLists.txt

…

get-model.cpp

…

get-model.h

…

run-json-schema-to-grammar.mjs

…

test-arg-parser.cpp

…

test-autorelease.cpp

…

test-backend-ops.cpp

…

test-barrier.cpp

…

test-c.c

…

test-chat-parser.cpp

…

test-chat-template.cpp

…

test-chat.cpp

…

test-double-float.cpp

…

test-gbnf-validator.cpp

cmake : do not include ./src as public for libllama (#13062 )

2025-04-24 16:00:10 +03:00

test-gguf.cpp

…

test-grammar-integration.cpp

…

test-grammar-llguidance.cpp

…

test-grammar-parser.cpp

…

test-json-partial.cpp

…

test-json-schema-to-grammar.cpp

…

test-llama-grammar.cpp

…

test-log.cpp

…

test-lora-conversion-inference.sh

…

test-model-load-cancel.cpp

…

test-mtmd-c-api.c

…

test-opt.cpp

…

test-quantize-fns.cpp

…

test-quantize-perf.cpp

…

test-quantize-stats.cpp

…

test-regex-partial.cpp

…

test-rope.cpp

…

test-sampling.cpp

…

test-thread-safety.cpp

…

test-tokenizer-0.cpp

…

test-tokenizer-0.py

…

test-tokenizer-0.sh

…

test-tokenizer-1-bpe.cpp

…

test-tokenizer-1-spm.cpp

…

test-tokenizer-random.py

…

test-tokenizers-repo.sh

…