llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-06-27 12:05:03 +00:00

Files

Đinh Trọng Huy ad590be98c model : add NeoBERT (#14164 )

* convert neobert model to gguf

* add inference graph

* fix flake8 lint

* followed reviewer suggestions

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

* follow reviewers suggestions

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

* override NeoBERT feed-forward length

---------

Co-authored-by: dinhhuy <huy.dinh@brains-tech.co.jp>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

2025-06-16 14:53:41 +02:00

scripts

gguf-py : add support for sub_type (in arrays) in GGUFWriter add_key_value method (#13561 )

2025-05-29 15:36:05 +02:00

__init__.py

convert-*.py: GGUF Naming Convention Refactor and Metadata Override Refactor (#7499 )

2024-07-18 20:40:15 +10:00

constants.py

model : add NeoBERT (#14164 )

2025-06-16 14:53:41 +02:00

gguf_reader.py

gguf-py : display the invalid gguf type (#13687 )