vllm/tests/spec_decode/e2e at edd5fe5fa29b8f9cc5fa37a30cc7211e0ff37067 - vllm - Gitea: Git with a cup of tea

wassname/vllm

mirror of https://github.com/wassname/vllm.git synced 2026-07-02 21:23:42 +08:00

Files

T

History

zifeitong 78687504f7 [Bugfix] AsyncLLMEngine hangs with asyncio.run (#5654 )

2024-06-19 13:57:12 -07:00

..

__init__.py

[Speculative decoding 7/9] Speculative decoding end-to-end correctness tests. (#3951 )

2024-04-23 08:02:36 +00:00

conftest.py

[Bugfix] AsyncLLMEngine hangs with asyncio.run (#5654 )

2024-06-19 13:57:12 -07:00

test_compatibility.py

[Speculative decoding][Re-take] Enable TP>1 speculative decoding (#4840 )

2024-05-16 00:53:51 -07:00

test_integration_dist.py

[Speculative decoding][Re-take] Enable TP>1 speculative decoding (#4840 )

2024-05-16 00:53:51 -07:00

test_integration.py

[Speculative decoding][Re-take] Enable TP>1 speculative decoding (#4840 )

2024-05-16 00:53:51 -07:00

test_logprobs.py

[Speculative decoding] Support target-model logprobs (#4378 )

2024-05-03 15:52:01 -07:00

test_multistep_correctness.py

[Speculative decoding][Re-take] Enable TP>1 speculative decoding (#4840 )

2024-05-16 00:53:51 -07:00

test_ngram_correctness.py

[Dynamic Spec Decoding] Minor fix for disabling speculative decoding (#5000 )

2024-05-25 10:00:14 -07:00