llama.cpp/examples at 22f281aa16f44d8f6ec2c180a0685ff27e04e714 - llama.cpp - Cat's Mantra

tqcq/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-09-01 12:52:17 -04:00

Files

History

M-A 22f281aa16 examples : Rewrite pydantic_models_to_grammar_examples.py (#8493 )

Changes:

- Move each example into its own function. This makes the code much
  easier to read and understand.
- Make the program easy to only run one test by commenting out function
  calls in main().
- Make the output easy to parse by indenting the output for each example.
- Add shebang and +x bit to make it clear it's an executable.
- Make the host configurable via --host with a default 127.0.0.1:8080.
- Make the code look in the tools list to call the registered tool,
  instead of hardcoding the returned values. This makes the code more
  copy-pastable.
- Add error checking, so that the program exits 1 if the LLM didn't
  returned expected values. It's super useful to check for correctness.

Testing:

- Tested with Mistral-7B-Instruct-v0.3 in F16 and Q5_K_M and
  Meta-Llama-3-8B-Instruct in F16 and Q5_K_M.
  - I did not observe a failure even once in Mistral-7B-Instruct-v0.3.
  - Llama-3 failed about a third of the time in example_concurrent: it
    only returned one call instead of 3. Even for F16.

Potential follow ups:

- Do not fix the prompt encoding yet. Surprisingly it mostly works even
  if the prompt encoding is not model optimized.
- Add chained answer and response.

Test only change.

2024-07-20 22:09:17 -04:00

..

…

batched: fix n_predict parameter (#8527 )

2024-07-17 10:34:28 +03:00

…

…

…

convert-llama2c-to-ggml

build: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )

2024-06-13 00:41:52 +01:00

cvector-generator

…

deprecation-warning

Deprecation warning to assist with migration to new binary names (#8283 )

2024-07-09 11:54:43 -04:00

…

examples : sprintf -> snprintf (#8434 )

2024-07-12 10:46:14 +03:00

export-lora : handle help argument (#8497 )

2024-07-16 10:04:45 +03:00

py : type-check all Python scripts with Pyright (#8341 )

2024-07-07 15:04:39 -04:00

llama : return nullptr from llama_grammar_init (#8093 )

2024-06-25 15:07:28 -04:00

gguf : handle null name during init (#8587 )

2024-07-20 17:15:42 +03:00

gguf-hash : update clib.json to point to original xxhash repo (#8491 )

2024-07-16 10:14:16 +03:00

…

…

…

infill : assert prefix/suffix tokens + remove old space logic (#8351 )

2024-07-08 09:34:35 +03:00

…

[CANN] Add Ascend NPU backend (#6035 )

2024-07-17 14:23:50 +03:00

…

llama.swiftui: fix end of generation bug (#8268 )

2024-07-20 16:09:37 +03:00

[CANN] Add Ascend NPU backend (#6035 )

2024-07-17 14:23:50 +03:00

…

lookup: fibonacci hashing, fix crashes (#8548 )

2024-07-17 23:35:44 +02:00

main : print error on empty input (#8456 )

2024-07-12 14:48:04 +03:00

Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (#8258 )

2024-07-02 12:18:10 -04:00

…

…

…

llama : valign + remove unused ftype (#8502 )

2024-07-16 10:00:30 +03:00

ggml : minor naming changes (#8433 )

2024-07-12 10:46:02 +03:00

llama : allow pooled embeddings on any model (#7477 )

2024-06-21 08:38:22 +03:00

…

save-load-state

…

server: use relative routes for static files in new UI (#8552 )

2024-07-18 12:43:49 +02:00

…

…

…

tokenize : add --no-parse-special option (#8423 )

2024-07-11 10:41:48 +03:00

train-text-from-scratch

py : type-check all Python scripts with Pyright (#8341 )

2024-07-07 15:04:39 -04:00

base-translate.sh

…

chat-13B.bat

…

chat-13B.sh

…

chat-persistent.sh

…

chat-vicuna.sh

…

chat.sh

…

CMakeLists.txt

…

convert_legacy_llama.py

convert-*.py: GGUF Naming Convention Refactor and Metadata Override Refactor (#7499 )

2024-07-18 20:40:15 +10:00

json_schema_pydantic_example.py

py : type-check all Python scripts with Pyright (#8341 )

2024-07-07 15:04:39 -04:00

json_schema_to_grammar.py

py : type-check all Python scripts with Pyright (#8341 )

2024-07-07 15:04:39 -04:00

llama.vim

…

llm.vim

…

Miku.sh

…

pydantic_models_to_grammar_examples.py

examples : Rewrite pydantic_models_to_grammar_examples.py (#8493 )

2024-07-20 22:09:17 -04:00

pydantic_models_to_grammar.py

pydantic : replace uses of __annotations__ with get_type_hints (#8474 )

2024-07-14 19:51:21 -04:00

reason-act.sh

…

regex_to_grammar.py

…

server_embd.py

py : type-check all Python scripts with Pyright (#8341 )

2024-07-07 15:04:39 -04:00

server-llama2-13B.sh

…

ts-type-to-grammar.sh

…