vllm/cacheflow/parallel_utils/tensor_parallel at 4858f3bb45ec62fab1fc32dc26eb1e2a8e1df14b - vllm - Gitea: Git with a cup of tea

wassname/vllm

mirror of https://github.com/wassname/vllm.git synced 2026-06-27 22:54:36 +08:00

Files

T

History

Woosuk Kwon 12659a0bd7 Add CUDA graph-based all reduce launcher (#26 )

2023-04-05 11:16:57 -07:00

..

__init__.py

Optimize tensor parallel execution speed (#17 )

2023-04-01 00:51:08 +08:00

layers.py

Add CUDA graph-based all reduce launcher (#26 )

2023-04-05 11:16:57 -07:00

mappings.py

Support tensor parallel (#2 )

2023-03-21 13:45:42 -07:00

random.py

Support tensor parallel (#2 )

2023-03-21 13:45:42 -07:00

utils.py

Support tensor parallel (#2 )

2023-03-21 13:45:42 -07:00