mirror of
https://github.com/wassname/ray.git
synced 2026-06-28 13:19:38 +08:00
9ea57c2a93
Rename AsyncSamplesOptimizer -> AsyncReplayOptimizer Add AsyncSamplesOptimizer that implements the IMPALA architecture integrate V-trace with a3c policy graph audit V-trace integration benchmark compare vs A3C and with V-trace on/off PongNoFrameskip-v4 on IMPALA scaling from 16 to 128 workers, solving Pong in <10 min. For reference, solving this env takes ~40 minutes for Ape-X and several hours for A3C.
Ray Documentation
To compile the documentation, run the following commands from this directory. Note that Ray must be installed first.
pip install -r requirements-doc.txt
make html
open _build/html/index.html
To test if there are any build errors with the documentation, do the following.
sphinx-build -W -b html -d _build/doctrees source _build/html