llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-15 20:53:00 -04:00

Files

Pierrick Hymbert d52d7819b8 server: concurrency fix + monitoring - add /metrics prometheus compatible endpoint (#5708 )

* server: monitoring - add /metrics prometheus compatible endpoint

* server: concurrency issue, when 2 task are waiting for results, only one call thread is notified

* server: metrics - move to a dedicated struct

2024-02-25 13:49:43 +01:00

steps.py

server: concurrency fix + monitoring - add /metrics prometheus compatible endpoint (#5708 )

2024-02-25 13:49:43 +01:00