vllm/vllm/distributed/device_communicators at 67d745cc68d9ad31bf683a88f00a1aee9782f541 - vllm - Gitea: Git with a cup of tea

wassname/vllm

mirror of https://github.com/wassname/vllm.git synced 2026-07-05 04:49:43 +08:00

Files

T

History

Woosuk Kwon fad5576c58 [TPU] Reduce compilation time & Upgrade PyTorch XLA version (#6856 )

2024-07-27 10:28:33 -07:00

..

__init__.py

[Core][Refactor] move parallel_utils into vllm/distributed (#3950 )

2024-04-10 15:33:30 -07:00

cuda_wrapper.py

[distributed][misc] be consistent with pytorch for libcudart.so (#6346 )

2024-07-11 19:35:17 -07:00

custom_all_reduce_utils.py

[CI/Build] vLLM cache directory for images (#6444 )

2024-07-15 23:12:25 -07:00

custom_all_reduce.py

[core][distributed] zmq fallback for broadcasting large objects (#6183 )

2024-07-09 18:49:11 -07:00

pynccl_wrapper.py

[mypy] Enable type checking for test directory (#5017 )

2024-06-15 04:45:31 +00:00

pynccl.py

[Distributed] Add send and recv helpers (#5719 )

2024-06-23 14:42:28 -07:00

shm_broadcast.py

[core][distributed] fix zmq hang (#6759 )

2024-07-24 17:37:12 -07:00

tpu_communicator.py

[TPU] Reduce compilation time & Upgrade PyTorch XLA version (#6856 )

2024-07-27 10:28:33 -07:00