Expose generation timings from server & update completions.js (#2116)

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-13 11:57:43 -04:00

* use javascript generators as much cleaner API

Also add ways to access completion as promise and EventSource

* export llama_timings as struct and expose them in server

* update readme, update baked includes

* llama : uniform variable names + struct init

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

This commit is contained in:

Tobias Lütke

2023-07-05 16:51:13 -04:00

committed by

GitHub

parent 983b555e9d

commit 31cfbb1013

9 changed files with 1921 additions and 1363 deletions

1581

examples/server/index.html.hpp

View File

File diff suppressed because it is too large Load Diff

Expose generation timings from server & update completions.js (#2116)

1581 examples/server/index.html.hpp View File

1581

examples/server/index.html.hpp

View File