vllm/tests/worker at 67d745cc68d9ad31bf683a88f00a1aee9782f541 - vllm - Gitea: Git with a cup of tea

wassname/vllm

mirror of https://github.com/wassname/vllm.git synced 2026-06-30 21:47:56 +08:00

Files

T

History

Cody Yu 309aaef825 [Bugfix] Fix decode tokens w. CUDA graph (#6757 )

2024-07-24 22:33:56 -07:00

..

__init__.py

[Speculative decoding 2/9] Multi-step worker for draft model (#2424 )

2024-01-21 16:31:47 -08:00

test_model_input.py

[Core] Refactor _prepare_model_input_tensors - take 2 (#6164 )

2024-07-17 09:37:16 -07:00

test_model_runner.py

[Bugfix] Fix decode tokens w. CUDA graph (#6757 )

2024-07-24 22:33:56 -07:00

test_swap.py

[Core] Pipeline Parallel Support (#4412 )

2024-07-02 10:58:08 -07:00