mirror of
https://github.com/wassname/vllm.git
synced 2026-06-29 12:09:14 +08:00
32aa2059ad
Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com>
329 B
329 B
(deploying-with-kserve)=
Deploying with KServe
vLLM can be deployed with KServe on Kubernetes for highly scalable distributed model serving.
Please see this guide for more details on using vLLM with KServe.