vllm/tests/quantization at ee93f4f92acbd9759a9af80747bc2a4459f07639 - vllm

mirror of https://github.com/wassname/vllm.git synced 2026-07-04 23:22:18 +08:00

Files

T

Qubitium-ModelCloud ee93f4f92a [CORE] Quantized lm-head Framework (#4442 )

Co-authored-by: Robert Shaw <rshaw@neuralmagic.com>
Co-authored-by: ZX <zx@lbx.dev>

2024-07-02 22:25:17 +00:00

__init__.py

2024-05-13 23:50:09 +09:00

test_bitsandbytes.py

2024-06-13 15:18:08 +00:00

test_compressed_tensors.py

2024-06-30 23:06:27 +00:00

test_configs.py

2024-06-15 04:45:31 +00:00

test_fp8.py

2024-06-30 23:06:27 +00:00

test_lm_head.py

2024-07-02 22:25:17 +00:00

utils.py

2024-06-30 20:07:34 -07:00