Lukas Straub
|
a9f77a8be3
|
server : add openai-style logit_bias support (#14946)
Signed-off-by: Lukas Straub <lukasstraub2@web.de>
|
2025-07-31 14:08:23 +02:00 |
|
Olivier Chafik
|
f13847cfb5
|
server: fix regression on streamed non-chat completion w/ stops (#13785)
* more forgiving message diffs: partial stop words aren't erased, full stops are
* Add (slow) server test for completion + stream + stop
|
2025-05-26 14:16:37 +01:00 |
|
Xuan-Son Nguyen
|
360a9c98e1
|
server : fix cache_tokens bug with no cache_prompt (#13533)
|
2025-05-14 13:35:07 +02:00 |
|
Diego Devesa
|
1d36b3670b
|
llama : move end-user examples to tools directory (#13249)
* llama : move end-user examples to tools directory
---------
Co-authored-by: Xuan Son Nguyen <son@huggingface.co>
|
2025-05-02 20:27:13 +02:00 |
|