This website requires JavaScript.
Explore
Help
Register
Sign In
wassname
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://github.com/wassname/vllm.git
synced
2026-06-27 22:24:35 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
33
Commits
1
Branch
0
Tags
2f4887de77fc36ec27f6b2e8d4cd52c9cf02efed
T
Code
Clone
HTTPS
Tea CLI
Open with VS Code
Open with VSCodium
Open with Intellij IDEA
Download ZIP
Download TAR.GZ
Download BUNDLE
Woosuk Kwon
2f4887de77
Fix KVCache shape
2023-02-16 01:24:45 +00:00
cacheflow
Fix KVCache shape
2023-02-16 01:24:45 +00:00
.gitignore
Add gitignore
2023-02-09 11:28:12 +00:00
README.md
Initial commit
2023-02-09 11:24:15 +00:00
setup.py
Add blank setup file
2023-02-09 11:37:06 +00:00
README.md
CacheFlow
Reference in New Issue
View Git Blame
Copy Permalink
S
Description
A high-throughput and memory-efficient inference and serving engine for LLMs
Readme
Apache-2.0
40
MiB
Languages
Python
85%
Cuda
10.2%
C++
3.1%
C
0.6%
Shell
0.6%
Other
0.4%