This website requires JavaScript.
Explore
Help
Register
Sign In
wassname
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://github.com/wassname/vllm.git
synced
2026-07-01 22:59:33 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
17
Commits
1
Branch
0
Tags
bb59a3e7302ad6892e097eee4040e3f516e9f4ea
T
Code
Clone
HTTPS
Tea CLI
Open with VS Code
Open with VSCodium
Open with Intellij IDEA
Download ZIP
Download TAR.GZ
Download BUNDLE
Woosuk Kwon
bb59a3e730
Fix cache engine
2023-02-13 09:35:48 +00:00
cacheflow
Fix cache engine
2023-02-13 09:35:48 +00:00
.gitignore
Add gitignore
2023-02-09 11:28:12 +00:00
README.md
Initial commit
2023-02-09 11:24:15 +00:00
setup.py
Add blank setup file
2023-02-09 11:37:06 +00:00
README.md
CacheFlow
Reference in New Issue
View Git Blame
Copy Permalink
S
Description
A high-throughput and memory-efficient inference and serving engine for LLMs
Readme
Apache-2.0
40
MiB
Languages
Python
85%
Cuda
10.2%
C++
3.1%
C
0.6%
Shell
0.6%
Other
0.4%