[Doc] Update description of vLLM support for CPUs (#6003)

2026-06-27 17:32:55 +08:00 · 2024-07-11 12:15:29 +08:00
parent 99ded1e1c4
commit 439c84581a
2 changed files with 2 additions and 2 deletions
@@ -59,7 +59,7 @@ vLLM is flexible and easy to use with:
 - Tensor parallelism support for distributed inference
 - Streaming outputs
 - OpenAI-compatible API server
- Support NVIDIA GPUs, AMD GPUs, Intel CPUs and GPUs
+- Support NVIDIA GPUs, AMD CPUs and GPUs, Intel CPUs and GPUs, PowerPC CPUs
 - (Experimental) Prefix caching support
 - (Experimental) Multi-lora support