This website requires JavaScript.
Explore
Help
Register
Sign In
wassname
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://github.com/wassname/vllm.git
synced
2026-07-05 04:49:43 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
67d745cc68d9ad31bf683a88f00a1aee9782f541
vllm
/
vllm
/
distributed
/
device_communicators
T
History
Woosuk Kwon
fad5576c58
[TPU] Reduce compilation time & Upgrade PyTorch XLA version (
#6856
)
2024-07-27 10:28:33 -07:00
..
__init__.py
[Core][Refactor] move parallel_utils into vllm/distributed (
#3950
)
2024-04-10 15:33:30 -07:00
cuda_wrapper.py
[distributed][misc] be consistent with pytorch for libcudart.so (
#6346
)
2024-07-11 19:35:17 -07:00
custom_all_reduce_utils.py
[CI/Build] vLLM cache directory for images (
#6444
)
2024-07-15 23:12:25 -07:00
custom_all_reduce.py
[core][distributed] zmq fallback for broadcasting large objects (
#6183
)
2024-07-09 18:49:11 -07:00
pynccl_wrapper.py
[mypy] Enable type checking for test directory (
#5017
)
2024-06-15 04:45:31 +00:00
pynccl.py
[Distributed] Add send and recv helpers (
#5719
)
2024-06-23 14:42:28 -07:00
shm_broadcast.py
[core][distributed] fix zmq hang (
#6759
)
2024-07-24 17:37:12 -07:00
tpu_communicator.py
[TPU] Reduce compilation time & Upgrade PyTorch XLA version (
#6856
)
2024-07-27 10:28:33 -07:00