This website requires JavaScript.
Explore
Help
Register
Sign In
wassname
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://github.com/wassname/vllm.git
synced
2026-06-28 01:47:27 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
2,381
Commits
1
Branch
0
Tags
0df7ec0b2d890799ca71e2f862fdff5fcc52cdc0
Commit Graph
2 Commits
Author
SHA1
Message
Date
SangBin Cho
ff7ec82c4d
[Core] Optimize SPMD architecture with delta + serialization optimization (
#7109
)
2024-08-18 17:57:20 -07:00
afeldman-nm
fd95e026e0
[Core] Subclass ModelRunner to support cross-attention & encoder sequences (towards eventual encoder/decoder model support) (
#4942
)
...
Co-authored-by: Andrew Feldman <
afeld2012@gmail.com
> Co-authored-by: Nick Hill <
nickhill@us.ibm.com
>
2024-08-06 16:51:47 -04:00