vllm/tests/models at d7740ea4dcee4ab75d7d6eef723f33cae957b288 - vllm

wassname/vllm

mirror of https://github.com/wassname/vllm.git synced 2026-07-05 04:32:37 +08:00

Files

T

History

alexm-nm 7038e8b803 [Kernel] Support running GPTQ 8-bit models in Marlin (#4533 )

2024-05-02 12:56:22 -04:00

test_aqlm.py

AQLM CUDA support (#3287 )

2024-04-23 13:59:33 -04:00

test_big_models.py

2024-04-30 21:18:14 -07:00

test_fp8.py

2024-04-30 21:46:12 +00:00

test_gptq_marlin.py

2024-05-02 12:56:22 -04:00

test_llava.py

2024-03-28 21:06:40 -07:00

test_marlin.py

2024-04-29 09:35:34 -07:00

test_mistral.py

2024-03-28 21:06:40 -07:00

test_models.py

2024-04-30 21:18:14 -07:00

test_oot_registration.py

2024-04-06 17:11:41 -07:00

utils.py

2024-04-29 09:35:34 -07:00