server: continuous performance monitoring and PR comment (#6283)

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-19 14:31:06 -04:00

* server: bench: init

* server: bench: reduce list of GPU nodes

* server: bench: fix graph, fix output artifact

* ci: bench: add mermaid in case of image cannot be uploaded

* ci: bench: more resilient, more metrics

* ci: bench: trigger build

* ci: bench: fix duration

* ci: bench: fix typo

* ci: bench: fix mermaid values, markdown generated

* typo on the step name

Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>

* ci: bench: trailing spaces

* ci: bench: move images in a details section

* ci: bench: reduce bullet point size

---------

Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>

This commit is contained in:

Pierrick Hymbert

2024-03-27 20:26:49 +01:00

committed by

GitHub

parent 53c7ec53d5

commit a016026a3a

5 changed files with 603 additions and 9 deletions

2

examples/server/bench/requirements.txt Normal file

View File

@@ -0,0 +1,2 @@
 matplotlib
 requests

server: continuous performance monitoring and PR comment (#6283)

2 examples/server/bench/requirements.txt Normal file Unescape Escape View File

2

examples/server/bench/requirements.txt Normal file

View File