mirror of
https://github.com/wassname/vllm.git
synced 2026-07-03 16:59:49 +08:00
195 B
195 B
CacheFlow
Installation
pip install psutil numpy torch transformers
pip install flash-attn # This may take up to 10 mins.
pip install -e .
Run
python server.py