ggml : add RPC backend (#6829)

mirror of https://github.com/ggml-org/llama.cpp.git synced 2025-08-18 05:56:00 -04:00

* ggml : add RPC backend

The RPC backend proxies all operations to a remote server which runs a
regular backend (CPU, CUDA, Metal, etc).

* set TCP_NODELAY

* add CI workflows

* Address review comments

* fix warning

* implement llama_max_devices() for RPC

* Address review comments

* Address review comments

* wrap sockfd into a struct

* implement get_alignment and get_max_size

* add get_device_memory

* fix warning

* win32 support

* add README

* readme : trim trailing whitespace

* Address review comments

* win32 fix

* Address review comments

* fix compile warnings on macos

This commit is contained in:

Radoslav Gerganov

2024-05-14 14:27:19 +03:00

committed by

GitHub

parent 541600201e

commit 5e31828d3e

12 changed files with 1395 additions and 98 deletions

									
										2

examples/rpc/CMakeLists.txt
									
										Normal file
									
												View File
												
				@@ -0,0 +1,2 @@

				add_executable(rpc-server rpc-server.cpp)

				target_link_libraries(rpc-server PRIVATE ggml llama)

ggml : add RPC backend (#6829)

2 examples/rpc/CMakeLists.txt Normal file Unescape Escape View File

2

examples/rpc/CMakeLists.txt Normal file

View File