|
bc098c3cf0
|
minja: sync (qwen3) (#13573)
* minja: sync f06140fa52
- https://github.com/google/minja/pull/67 (@grf53)
- https://github.com/google/minja/pull/66 (@taha-yassine)
- https://github.com/google/minja/pull/63 (@grf53)
- https://github.com/google/minja/pull/58
---------
Co-authored-by: ochafik <ochafik@google.com>
|
2025-05-15 23:29:10 +01:00 |
|
|
7a84777f42
|
sync: minja (#12739)
* sync: minja
https://github.com/google/minja/pull/57
* fix json include
|
2025-04-04 21:16:39 +01:00 |
|
|
5f696e88e0
|
sync : minja (inclusionAI/Ling) and update tests (#12699)
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
|
2025-04-03 13:51:35 +02:00 |
|
|
a6f32f0b34
|
Fix clang warning in gguf_check_reserved_keys (#12686)
* Fix clang warning in gguf_check_reserved_keys
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
* Fix typo
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
---------
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
|
2025-04-01 13:12:53 +02:00 |
|
|
7cf64f6bee
|
sync: minja - support QwQ-32B (#12235)
8a76f7815e
|
2025-03-07 09:33:37 +00:00 |
|
|
63e489c025
|
tool-call: refactor common chat / tool-call api (+ tests / fixes) (#11900)
* tool-call refactoring: moved common_chat_* to chat.h, common_chat_templates_init return a unique_ptr to opaque type
* addressed clang-tidy lints in [test-]chat.*
* rm minja deps from util & common & move it to common/minja/
* add name & tool_call_id to common_chat_msg
* add common_chat_tool
* added json <-> tools, msgs conversions to chat.h
* fix double bos/eos jinja avoidance hack (was preventing inner bos/eos tokens)
* fix deepseek r1 slow test (no longer <think> opening w/ new template)
* allow empty tools w/ auto + grammar
* fix & test server grammar & json_schema params w/ & w/o --jinja
|
2025-02-18 18:03:23 +00:00 |
|