This website requires JavaScript.
Explore
Help
Register
Sign In
wassname
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://github.com/wassname/vllm.git
synced
2026-06-29 09:57:17 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
38
Commits
1
Branch
0
Tags
3b41f16596e9981dac8df85b4eff00311abcfec3
T
Code
Clone
HTTPS
Tea CLI
Open with VS Code
Open with VSCodium
Open with Intellij IDEA
Download ZIP
Download TAR.GZ
Download BUNDLE
Woosuk Kwon
3b41f16596
Add gitignore
2023-02-16 07:47:21 +00:00
cacheflow
Implement cache ops
2023-02-16 07:47:03 +00:00
csrc
Implement cache ops
2023-02-16 07:47:03 +00:00
.gitignore
Add gitignore
2023-02-16 07:47:21 +00:00
README.md
Initial commit
2023-02-09 11:24:15 +00:00
setup.py
Implement cache ops
2023-02-16 07:47:03 +00:00
README.md
CacheFlow
Reference in New Issue
View Git Blame
Copy Permalink
S
Description
A high-throughput and memory-efficient inference and serving engine for LLMs
Readme
Apache-2.0
40
MiB
Languages
Python
85%
Cuda
10.2%
C++
3.1%
C
0.6%
Shell
0.6%
Other
0.4%