Default Branch

bf5bcd0b85 · docs: update s390x documentation + add faq (#14389) · Updated 2025-06-26 10:41:41 +00:00

Branches

29acf2cf05 · context : move the change to llama_context::encode() · Updated 2025-03-18 09:55:19 +00:00    tqcq

848
2

90f17bba01 · Vulkan: Default to 1GB allocations instead of 4GB to avoid fragmentation and driver issues · Updated 2025-03-17 19:41:11 +00:00    tqcq

853
1

f6711cef44 · CUDA: determine FA parallel blocks at runtime · Updated 2025-03-16 13:36:57 +00:00    tqcq

916
1

c4aca65582 · hparams : add SWA rope parameters · Updated 2025-03-13 17:26:09 +00:00    tqcq

875
1

21fe0ce4eb · hparams : add comment [no ci] · Updated 2025-03-13 15:56:38 +00:00    tqcq

876
2

ed58975f51 · server : improve infill stop criteria · Updated 2025-03-12 13:28:48 +00:00    tqcq

887
1

87dae2fd15 · Vulkan: Print coopmat shapes, then exit · Updated 2025-03-09 10:53:55 +00:00    tqcq

901
1

25840747e6 · Vulkan: Add device architecture enum and logic to recognize AMD generations · Updated 2025-03-08 08:04:45 +00:00    tqcq

1081
2

c75753a01b · server : infill gen ends on new line · Updated 2025-03-07 15:19:55 +00:00    tqcq

904
1

aefa65e442 · ci : fix save-load test invokations · Updated 2025-03-07 10:17:33 +00:00    tqcq

909
1

aae2903e0b · clang-tidy : disable bugprone-branch-clone · Updated 2025-03-07 09:36:55 +00:00    tqcq

910
1

624f7bd03b · graph : add comments · Updated 2025-02-28 19:13:08 +00:00    tqcq

974
95

0f2bf55502 · speculative : do not discard the last drafted token · Updated 2025-02-19 07:21:39 +00:00    tqcq

1017
2

8654805027 · docker : publish to both ggerganov and ggml-org · Updated 2025-02-15 14:18:04 +00:00    tqcq

1058
1

f30aca84b2 · Revert "HIP: Switch to std::vector in rocblas version check (#11820)" · Updated 2025-02-12 18:22:04 +00:00    tqcq

1062
1

cfb0ae7e4c · ggml : fix more imatrix nan cases · Updated 2025-02-09 17:15:02 +00:00    tqcq

1081
1

d86e23101e · server : minor log updates · Updated 2025-02-08 14:23:37 +00:00    tqcq

1088
1

e00c9d1c5e · Update examples/server/server.cpp · Updated 2025-02-06 07:55:41 +00:00    tqcq

1114
3

3b6a0a817a · llama : add log about loading model tensors · Updated 2025-02-06 07:24:07 +00:00    tqcq

1109
1

947158ee52 · Specify podman works in Container documentation · Updated 2025-02-05 13:47:21 +00:00    tqcq

1114
1