vllm/tests/basic_correctness/test_cpu_offload.py at 235366fe2eb3144321978e181af94487f0215595 - vllm - Gitea: Git with a cup of tea

wassname/vllm

mirror of https://github.com/wassname/vllm.git synced 2026-06-28 03:52:15 +08:00

Files

T

Wallas Henrique c0292211ce [CI/Build] Replaced some models on tests for smaller ones (#9570 )

Signed-off-by: Wallas Santos <wallashss@ibm.com>

2024-10-22 04:52:14 +00:00

7 lines

175 B

Python

Raw Blame History

 from ..utils import compare_two_settings
 def test_cpu_offload():
     compare_two_settings("meta-llama/Llama-3.2-1B", [],
                          ["--cpu-offload-gb", "1"])