llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-06-26 11:45:21 +00:00

master

bf5bcd0b85 · docs: update s390x documentation + add faq (#14389) · Updated 2025-06-26 10:41:41 +00:00

gg/context-fix-enc-attn-type 29acf2cf05 · context : move the change to llama_context::encode() · Updated 2025-03-18 09:55:19 +00:00 tqcq	848 2	ZIP TAR.GZ
0cc4m/vulkan-suballoc-1gb 90f17bba01 · Vulkan: Default to 1GB allocations instead of 4GB to avoid fragmentation and driver issues · Updated 2025-03-17 19:41:11 +00:00 tqcq	853 1	ZIP TAR.GZ
jg/cuda-fa-np-runtime f6711cef44 · CUDA: determine FA parallel blocks at runtime · Updated 2025-03-16 13:36:57 +00:00 tqcq	916 1	ZIP TAR.GZ
gg/hparams-swa-rope c4aca65582 · hparams : add SWA rope parameters · Updated 2025-03-13 17:26:09 +00:00 tqcq	875 1	ZIP TAR.GZ
gg/swa-fix-kv-shift 21fe0ce4eb · hparams : add comment [no ci] · Updated 2025-03-13 15:56:38 +00:00 tqcq	876 2	ZIP TAR.GZ
gg/infill-better-stop ed58975f51 · server : improve infill stop criteria · Updated 2025-03-12 13:28:48 +00:00 tqcq	887 1	ZIP TAR.GZ
0cc4m/vulkan-print-coopmat-shapes 87dae2fd15 · Vulkan: Print coopmat shapes, then exit · Updated 2025-03-09 10:53:55 +00:00 tqcq	901 1	ZIP TAR.GZ
0cc4m/vulkan-device-architecture 25840747e6 · Vulkan: Add device architecture enum and logic to recognize AMD generations · Updated 2025-03-08 08:04:45 +00:00 tqcq	1081 2	ZIP TAR.GZ
gg/server-infill-end-on-nl c75753a01b · server : infill gen ends on new line · Updated 2025-03-07 15:19:55 +00:00 tqcq	904 1	ZIP TAR.GZ
gg/ci-fix-save-load aefa65e442 · ci : fix save-load test invokations · Updated 2025-03-07 10:17:33 +00:00 tqcq	909 1	ZIP TAR.GZ
gg/clang-tidy-disable-bugprone aae2903e0b · clang-tidy : disable bugprone-branch-clone · Updated 2025-03-07 09:36:55 +00:00 tqcq	910 1	ZIP TAR.GZ
gg/llama-kv-cache 624f7bd03b · graph : add comments · Updated 2025-02-28 19:13:08 +00:00 tqcq	974 95	ZIP TAR.GZ
gg/speculative-update 0f2bf55502 · speculative : do not discard the last drafted token · Updated 2025-02-19 07:21:39 +00:00 tqcq	1017 2	ZIP TAR.GZ
xsn/ci_legacy_gg 8654805027 · docker : publish to both ggerganov and ggml-org · Updated 2025-02-15 14:18:04 +00:00 tqcq	1058 1	ZIP TAR.GZ
revert-11820-vers_fix f30aca84b2 · Revert "HIP: Switch to std::vector in rocblas version check (#11820)" · Updated 2025-02-12 18:22:04 +00:00 tqcq	1062 1	ZIP TAR.GZ
sl/more-imatrix-nan-fixes cfb0ae7e4c · ggml : fix more imatrix nan cases · Updated 2025-02-09 17:15:02 +00:00 tqcq	1081 1	ZIP TAR.GZ
gg/server-logs d86e23101e · server : minor log updates · Updated 2025-02-08 14:23:37 +00:00 tqcq	1088 1	ZIP TAR.GZ
fall-back-to-jinja e00c9d1c5e · Update examples/server/server.cpp · Updated 2025-02-06 07:55:41 +00:00 tqcq	1114 3	ZIP TAR.GZ
gg/llama-add-log 3b6a0a817a · llama : add log about loading model tensors · Updated 2025-02-06 07:24:07 +00:00 tqcq	1109 1	ZIP TAR.GZ
podman 947158ee52 · Specify podman works in Container documentation · Updated 2025-02-05 13:47:21 +00:00 tqcq	1114 1	ZIP TAR.GZ

... 2 3 4 5 6 ...

Default Branch

Branches