llama.cpp

tqcq/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-15 04:33:06 -04:00

Files

History

Douglas Hanley 339bd0268c model : support Qwen3-Embedding (#15023 )

2025-08-02 10:44:50 +02:00

2025-07-16 00:04:42 +02:00

__init__.py

2024-07-18 20:40:15 +10:00

constants.py

2025-08-01 15:31:12 +02:00

gguf_reader.py

2025-05-21 16:33:54 +02:00

gguf_writer.py

2025-07-31 19:49:09 +08:00

gguf.py

…

lazy.py

2025-04-08 09:03:07 +02:00

metadata.py

2025-07-22 19:29:43 +03:00

py.typed

…

quants.py

2024-09-05 21:48:47 -04:00

tensor_mapping.py

2025-08-02 10:44:50 +02:00

utility.py

2025-05-28 23:50:20 +02:00

vocab.py

2025-07-28 15:01:48 +02:00