vllm/tests/kernels at c9415c19d3df26d8ede611abefba35c6837cd934 - vllm - Gitea: Git with a cup of tea

wassname/vllm

mirror of https://github.com/wassname/vllm.git synced 2026-06-30 06:48:05 +08:00

Files

T

History

Zhuohan Li 2f8844ba08 Re-enable the 80 char line width limit (#3305 )

2024-03-10 19:49:14 -07:00

..

allclose_default.py

[ROCm] Fix some kernels failed unit tests (#2498 )

2024-02-05 14:25:36 -08:00

conftest.py

Support FP8-E5M2 KV Cache (#2279 )

2024-01-28 16:43:54 -08:00

test_activation.py

Optimize GeGLU layer in Gemma (#2975 )

2024-02-21 20:17:52 -08:00

test_attention.py

[ROCm] Fix some kernels failed unit tests (#2498 )

2024-02-05 14:25:36 -08:00

test_cache.py

[Minor] More fix of test_cache.py CI test failure (#2750 )

2024-02-06 11:38:38 -08:00

test_layernorm.py

Remove hardcoded device="cuda" to support more devices (#2503 )

2024-02-01 15:46:39 -08:00

test_moe.py

Re-enable the 80 char line width limit (#3305 )

2024-03-10 19:49:14 -07:00

test_pos_encoding.py

[ROCm] Fix some kernels failed unit tests (#2498 )

2024-02-05 14:25:36 -08:00

test_prefix_prefill.py

Re-enable the 80 char line width limit (#3305 )

2024-03-10 19:49:14 -07:00