server : match OAI structured output response (#9527)

2025-06-26 19:55:04 +00:00 · 2024-09-18 01:50:34 -05:00
parent f799155ab8
commit 8a308354f6
3 changed files with 5 additions and 2 deletions
--- a/grammars/README.md
+++ b/grammars/README.md
@ -120,7 +120,7 @@ You can use GBNF grammars:

 - In [llama-server](../examples/server):
    - For any completion endpoints, passed as the `json_schema` body field
-    - For the `/chat/completions` endpoint, passed inside the `response_format` body field (e.g. `{"type", "json_object", "schema": {"items": {}}}`)
+    - For the `/chat/completions` endpoint, passed inside the `response_format` body field (e.g. `{"type", "json_object", "schema": {"items": {}}}` or `{ type: "json_schema", json_schema: {"schema": ...} }`)
 - In [llama-cli](../examples/main), passed as the `--json` / `-j` flag
 - To convert to a grammar ahead of time:
    - in CLI, with [examples/json_schema_to_grammar.py](../examples/json_schema_to_grammar.py)