vllm/tests/weight_loading at fdd9daafa3b31746ec8ec7c0d67ebc7efeb13f8f - vllm

mirror of https://github.com/wassname/vllm.git synced 2026-07-01 11:11:30 +08:00

Files

T

Dipika Sikka fc911880cc [Kernel] Expand MoE weight loading + Add Fused Marlin MoE Kernel (#7766 )

Co-authored-by: ElizaWszola <eliza@neuralmagic.com>

2024-08-27 15:07:09 -07:00

models.txt

2024-08-27 15:07:09 -07:00

run_model_weight_loading_test.sh

2024-08-13 14:30:11 -04:00

test_weight_loading.py

2024-08-13 14:30:11 -04:00