Logo
Explore Help
Register Sign In
wassname/vllm
1
0
Fork 0
You've already forked vllm
mirror of https://github.com/wassname/vllm.git synced 2026-06-30 11:46:07 +08:00
Code Issues Packages Projects Releases Wiki Activity
2,721 Commits 1 Branch 0 Tags
b4e4eda92e1d3a013fc4007db64b69d8604264ff
Commit Graph

2 Commits

Author SHA1 Message Date
afeldman-nm fd95e026e0 [Core] Subclass ModelRunner to support cross-attention & encoder sequences (towards eventual encoder/decoder model support) (#4942)
Co-authored-by: Andrew Feldman <afeld2012@gmail.com>
Co-authored-by: Nick Hill <nickhill@us.ibm.com>
2024-08-06 16:51:47 -04:00
afeldman-nm 543aa48573 [Kernel] Correctly invoke prefill & decode kernels for cross-attention (towards eventual encoder/decoder model support) (#4888)
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
2024-07-08 17:12:15 +00:00
Powered by Gitea Version: 1.26.4 Page: 181ms Template: 1ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API