llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-01 15:09:32 -04:00

Files

Xuan-Son Nguyen d2b2031e5f llama : (mrope) allow using normal 1D position for text token (#13138 )

* llama : (mrope) use normal position for text token

* rm n_pos_per_embd from llm_graph_input_attn_temp

2025-04-28 14:20:56 +02:00

batched

…

batched-bench

…

batched.swift

…

convert-llama2c-to-ggml

…

cvector-generator

…

deprecation-warning

…

embedding

…

eval-callback

…

export-lora

…

gen-docs

…

gguf

…

gguf-hash

…

gguf-split

…

gritlm

…

imatrix

…

infill

…

jeopardy

…

llama-bench

…

llama.android

…

llama.swiftui

…

llava

…

lookahead

…

lookup

…

main

…

parallel

…

passkey

…

perplexity

…

quantize

…

retrieval

…

rpc

…

run

contrib: support modelscope community (#12664 )

2025-04-11 14:01:56 +02:00

save-load-state

…

server

…

simple

…

simple-chat

…

simple-cmake-pkg

…

speculative

…

speculative-simple

…

sycl

…

tokenize

…

tts

…

chat-13B.bat

…

chat-13B.sh

…

chat-persistent.sh

…

chat-vicuna.sh

…

chat.sh

…

CMakeLists.txt

…

convert_legacy_llama.py

…

json_schema_pydantic_example.py

…

json_schema_to_grammar.py

…

llama.vim

…

llm.vim

…

Miku.sh

…

pydantic_models_to_grammar_examples.py

…

pydantic_models_to_grammar.py

…

reason-act.sh

…

regex_to_grammar.py

…

server_embd.py

…

server-llama2-13B.sh

…

ts-type-to-grammar.sh

…