This website requires JavaScript.
Explore
Help
Register
Sign In
wassname
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://github.com/wassname/vllm.git
synced
2026-07-03 14:54:46 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
18
Commits
1
Branch
0
Tags
fffa2e1f4b7534d5f86e900838d9a24dfba307c9
T
Code
Clone
HTTPS
Tea CLI
Open with VS Code
Open with VSCodium
Open with Intellij IDEA
Download ZIP
Download TAR.GZ
Download BUNDLE
Woosuk Kwon
fffa2e1f4b
Add model_utils
2023-02-13 09:36:12 +00:00
cacheflow
Add model_utils
2023-02-13 09:36:12 +00:00
.gitignore
Add gitignore
2023-02-09 11:28:12 +00:00
README.md
Initial commit
2023-02-09 11:24:15 +00:00
setup.py
Add blank setup file
2023-02-09 11:37:06 +00:00
README.md
CacheFlow
Reference in New Issue
View Git Blame
Copy Permalink
S
Description
A high-throughput and memory-efficient inference and serving engine for LLMs
Readme
Apache-2.0
40
MiB
Languages
Python
85%
Cuda
10.2%
C++
3.1%
C
0.6%
Shell
0.6%
Other
0.4%