llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-14 20:29:41 -04:00

Files

matiaslin faac0bae26 common : ensure llama_batch size does not exceed max size (#9668 )

A crash was observed when the number of tokens added to a batch exceeds
llama_batch size. An assertion in llama_batch_add was added to protect
against llama_batch size overflow.

2024-09-29 15:25:00 +03:00

cmake

…

arg.cpp

llama : add reranking support (#9510 )

2024-09-28 17:42:03 +03:00

arg.h

common : move arg parser code to arg.cpp (#9388 )

2024-09-09 23:36:09 +02:00

base64.hpp

…

build-info.cpp.in

…

CMakeLists.txt

common : reimplement logging (#9418 )

2024-09-15 20:46:12 +03:00

common.cpp

common : ensure llama_batch size does not exceed max size (#9668 )

2024-09-29 15:25:00 +03:00

common.h

llama : add reranking support (#9510 )

2024-09-28 17:42:03 +03:00

console.cpp

…

console.h

…

json-schema-to-grammar.cpp

…

json-schema-to-grammar.h

…

json.hpp

…

log.cpp

log : add CONT level for continuing previous log entry (#9610 )

2024-09-24 10:15:35 +03:00

log.h

log : add CONT level for continuing previous log entry (#9610 )

2024-09-24 10:15:35 +03:00

ngram-cache.cpp

common : reimplement logging (#9418 )

2024-09-15 20:46:12 +03:00

ngram-cache.h

…

sampling.cpp

sampling : avoid expensive softmax during greedy sampling (#9605 )

2024-09-24 09:03:17 +03:00

sampling.h

llama : move random seed generation to the samplers (#9398 )

2024-09-10 18:04:25 +02:00

stb_image.h

…

train.cpp

common : reimplement logging (#9418 )

2024-09-15 20:46:12 +03:00

train.h

…