Logo
Explore Help
Sign In
tqcq/llama.cpp
0
0
Fork 0
You've already forked llama.cpp
mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-16 13:12:51 -04:00
Code Issues Packages Projects Releases Wiki Activity
Files
148844fe97fff4c1563a3111bf238ba4dd22ef56
llama.cpp/src
History
Georgi Gerganov cad341d889 metal : reduce command encoding overhead (#9698)
* metal : reduce command encoding overhead

ggml-ci

* metal : add comments
2024-10-01 16:00:25 +03:00
..
CMakeLists.txt
llama : move vocab, grammar and sampling into separate files (#8508)
2024-07-23 13:10:17 +03:00
llama-grammar.cpp
llama : refactor sampling v2 (#9294)
2024-09-07 15:16:19 +03:00
llama-grammar.h
llama : refactor sampling v2 (#9294)
2024-09-07 15:16:19 +03:00
llama-impl.h
log : add CONT level for continuing previous log entry (#9610)
2024-09-24 10:15:35 +03:00
llama-sampling.cpp
sampling : avoid expensive softmax during greedy sampling (#9605)
2024-09-24 09:03:17 +03:00
llama-sampling.h
llama : refactor samplers internal implementation (#9370)
2024-09-08 15:52:07 +02:00
llama-vocab.cpp
llama : add reranking support (#9510)
2024-09-28 17:42:03 +03:00
llama-vocab.h
vocab : refactor tokenizer to reduce init overhead (#9449)
2024-09-28 15:10:58 +03:00
llama.cpp
metal : reduce command encoding overhead (#9698)
2024-10-01 16:00:25 +03:00
unicode-data.cpp
…
unicode-data.h
…
unicode.cpp
unicode : add <algorithm> (#9508)
2024-09-17 09:51:15 +03:00
unicode.h
llama : move vocab, grammar and sampling into separate files (#8508)
2024-07-23 13:10:17 +03:00
Powered by Gitea Version: 1.24.4 Page: 1657ms Template: 22ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API