llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-07-29 13:43:38 -04:00

Files

stduhpf e0324285a5 speculative : threading options (#4959 )

* speculative: expose draft threading

* fix usage format

* accept -td and -tbd args

* speculative: revert default behavior when -td is unspecified

* fix trailing whitespace

2024-01-16 13:04:32 +02:00

base64.hpp

llava : expose as a shared library for downstream projects (#3613 )

2023-11-07 00:36:23 +03:00

build-info.cpp.in

build : link against build info instead of compiling against it (#3879 )

2023-11-02 08:50:16 +02:00

CMakeLists.txt

cmake : fix ld warning duplicate libraries libllama.a (#4671 )

2023-12-29 16:39:15 +02:00

common.cpp

speculative : threading options (#4959 )

2024-01-16 13:04:32 +02:00

common.h

speculative : threading options (#4959 )

2024-01-16 13:04:32 +02:00

console.cpp

check C++ code with -Wmissing-declarations (#3184 )

2023-09-15 15:38:27 -04:00

console.h

gguf : new file format with flexible meta data (beta) (#2398 )

2023-08-21 23:07:43 +03:00

grammar-parser.cpp

grammar-parser : fix typo (#4318 )

2023-12-04 09:57:35 +02:00

grammar-parser.h

gguf : new file format with flexible meta data (beta) (#2398 )

2023-08-21 23:07:43 +03:00

log.h

english : use typos to fix comments and logs (#4354 )

2023-12-12 11:53:36 +02:00

sampling.cpp

llama : apply classifier-free guidance to logits directly (#4951 )

2024-01-15 15:06:52 +02:00

sampling.h

server : allow to specify custom prompt for penalty calculation (#3727 )

2023-12-23 11:31:49 +02:00

stb_image.h

examples: support LLaVA v1.5 (multimodal model) (#3436 )

2023-10-12 18:23:18 +03:00

train.cpp

train : fix typo in overlapping-samples help msg (#4758 )

2024-01-03 19:53:40 +02:00

train.h

sync : ggml (backend v2) (#3912 )

2023-11-13 14:16:23 +02:00