[docs] Distributed Training Quickfix (#5571)

This commit is contained in:
Richard Liaw
2019-08-29 15:38:43 -07:00
committed by GitHub
parent 04b869678e
commit fb40787603
+2 -2
View File
@@ -38,7 +38,7 @@ Then, start a Ray cluster `via autoscaler <autoscaling.html>`_ or `manually <usi
In your training program, insert the following:
.. code-block::
.. code-block:: python
torch.distributed.init_process_group(backend='YOUR BACKEND',
init_method='env://')
@@ -49,7 +49,7 @@ In your training program, insert the following:
Then, separately, on each machine:
.. code-block::
.. code-block:: bash
# Node 1: *(IP: 192.168.1.1, and has a free port: 1234)*
$ python -m torch.distributed.launch --nproc_per_node=NUM_GPUS_YOU_HAVE