llama.cpp/examples at c263ca767b080fce9bf75accea41026b6e7542b9 - llama.cpp - Cat's Mantra

tqcq/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-27 18:49:40 -04:00

Files

History

Xuan Son Nguyen 958367bf53 server : refactor slot input data, move tokenizer to HTTP thread (#10023 )

* server : refactor slot input data, move tokenizer to HTTP thread

* move prompt_tokens.empty() check

* fix incorrect if branch

* fix infinite generation loop

* bring back infill validation

* add infill test

* try fixing format_infill

* fix test

* remove redundant code

* rename completion to inference

* update docs

* use llama_tokens everywhere

2024-10-24 21:51:22 +02:00

..

Threadpool: take 2 (#8672 )

2024-08-30 01:20:53 +02:00

common : use common_ prefix for common library functions (#9805 )

2024-10-10 22:57:42 +02:00

llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745 )

2024-10-18 23:18:01 +02:00

…

convert-llama2c-to-ggml

common : use common_ prefix for common library functions (#9805 )

2024-10-10 22:57:42 +02:00

cvector-generator

llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745 )

2024-10-18 23:18:01 +02:00

deprecation-warning

…

common : use common_ prefix for common library functions (#9805 )

2024-10-10 22:57:42 +02:00

llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745 )

2024-10-18 23:18:01 +02:00

common : use common_ prefix for common library functions (#9805 )

2024-10-10 22:57:42 +02:00

…

common : use common_ prefix for common library functions (#9805 )

2024-10-10 22:57:42 +02:00

…

…

…

common : use common_ prefix for common library functions (#9805 )

2024-10-10 22:57:42 +02:00

llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745 )

2024-10-18 23:18:01 +02:00

llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745 )

2024-10-18 23:18:01 +02:00

…

llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745 )

2024-10-18 23:18:01 +02:00

llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745 )

2024-10-18 23:18:01 +02:00

llama : default sampling changes + greedy update (#9897 )

2024-10-21 09:46:40 +03:00

llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745 )

2024-10-18 23:18:01 +02:00

llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745 )

2024-10-18 23:18:01 +02:00

llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745 )

2024-10-18 23:18:01 +02:00

llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745 )

2024-10-18 23:18:01 +02:00

…

llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745 )

2024-10-18 23:18:01 +02:00

common : use common_ prefix for common library functions (#9805 )

2024-10-10 22:57:42 +02:00

llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745 )

2024-10-18 23:18:01 +02:00

…

…

common : use common_ prefix for common library functions (#9805 )

2024-10-10 22:57:42 +02:00

…

save-load-state

llama : default sampling changes + greedy update (#9897 )

2024-10-21 09:46:40 +03:00

server : refactor slot input data, move tokenizer to HTTP thread (#10023 )

2024-10-24 21:51:22 +02:00

llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745 )

2024-10-18 23:18:01 +02:00

llama : default sampling changes + greedy update (#9897 )

2024-10-21 09:46:40 +03:00

…

common : use common_ prefix for common library functions (#9805 )

2024-10-10 22:57:42 +02:00

base-translate.sh

…

chat-13B.bat

Create chat-13B.bat (#592 )

2023-03-29 20:21:09 +03:00

chat-13B.sh

…

chat-persistent.sh

…

chat-vicuna.sh

…

chat.sh

…

CMakeLists.txt

…

convert_legacy_llama.py

…

json_schema_pydantic_example.py

…

json_schema_to_grammar.py

grammar : fix JSON Schema for string regex with top-level alt. (#9903 )

2024-10-16 19:03:24 +03:00

llama.vim

llama.vim : bump generation time limit to 3s [no ci]

2024-10-23 17:16:56 +03:00

llm.vim

…

Miku.sh

…

pydantic_models_to_grammar_examples.py

…

pydantic_models_to_grammar.py

…

reason-act.sh

build: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )

2024-06-13 00:41:52 +01:00

regex_to_grammar.py

…

server_embd.py

…

server-llama2-13B.sh

…

ts-type-to-grammar.sh

…