vllm/tests/weight_loading at ab1091d5f2fc879ba9e62002f4d9eec013984d4d - vllm

mirror of https://github.com/wassname/vllm.git synced 2026-07-04 20:49:14 +08:00

Files

T

Michael Goin 5e5c8e091e [Quant][Perf] Use moe_wna16 kernel by default for MoEs with many experts (#13236 )

Signed-off-by: mgoin <mgoin64@gmail.com>

2025-02-14 12:53:42 -08:00

models-large.txt

2025-02-06 01:02:14 -08:00

models.txt

2025-01-30 23:49:37 -08:00

run_model_weight_loading_test.sh

2025-01-30 23:49:37 -08:00

test_weight_loading.py

2025-02-14 12:53:42 -08:00