This website requires JavaScript.
Explore
Help
Register
Sign In
wassname
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://github.com/wassname/vllm.git
synced
2026-06-27 19:49:51 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
8
Commits
1
Branch
0
Tags
e7bee2aa811963b8c5ce352a427595749f6bfca1
T
Code
Clone
HTTPS
Tea CLI
Open with VS Code
Open with VSCodium
Open with Intellij IDEA
Download ZIP
Download TAR.GZ
Download BUNDLE
Woosuk Kwon
e7bee2aa81
Add cache engine
2023-02-09 11:28:02 +00:00
cacheflow
Add cache engine
2023-02-09 11:28:02 +00:00
README.md
Initial commit
2023-02-09 11:24:15 +00:00
README.md
CacheFlow
Reference in New Issue
View Git Blame
Copy Permalink
S
Description
A high-throughput and memory-efficient inference and serving engine for LLMs
Readme
Apache-2.0
40
MiB
Languages
Python
85%
Cuda
10.2%
C++
3.1%
C
0.6%
Shell
0.6%
Other
0.4%