mirror of
https://github.com/wassname/vllm.git
synced 2026-06-28 21:48:21 +08:00
8ceffbf315
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
310 B
310 B
(deployment-kserve)=
KServe
vLLM can be deployed with KServe on Kubernetes for highly scalable distributed model serving.
Please see this guide for more details on using vLLM with KServe.