Fixes a typo about 'max_decode_seq_len' which causes crashes with cuda graph. (#9285)

mirror of https://github.com/wassname/vllm.git synced 2026-06-27 19:01:53 +08:00

Signed-off-by: Tao He <linzhu.ht@alibaba-inc.com>

This commit is contained in:

Tao He

2024-11-08 13:31:28 +08:00

committed by

GitHub

parent 3a7f15a398

commit da07a9ead7