ray/python/ray/rllib at 3bf80839cb06d2e0e3779f242cf4b4e12e70fa02 - ray - Gitea: Git with a cup of tea

wassname/ray

mirror of https://github.com/wassname/ray.git synced 2026-06-29 13:15:35 +08:00

Files

T

History

Eric Liang faaa123046 [rllib] Set num_cpu=None for workers in the default settings (#1793 )

2018-03-29 16:33:40 -07:00

..

[rllib] [docs] Cleanup RLlib API and make docs consistent with upcoming blog post (#1708 )

2018-03-15 15:57:31 -07:00

[rllib] [docs] Cleanup RLlib API and make docs consistent with upcoming blog post (#1708 )

2018-03-15 15:57:31 -07:00

[rllib] Set num_cpu=None for workers in the default settings (#1793 )

2018-03-29 16:33:40 -07:00

Remove from X import Y convention in RLlib ES. (#1774 )

2018-03-23 05:54:31 -07:00

[rllib] Upgrade to OpenAI Gym 0.10.3 (#1601 )

2018-03-06 00:31:02 -08:00

[rllib] Upgrade to OpenAI Gym 0.10.3 (#1601 )

2018-03-06 00:31:02 -08:00

[rllib] Set num_cpu=None for workers in the default settings (#1793 )

2018-03-29 16:33:40 -07:00

[rllib] [docs] Cleanup RLlib API and make docs consistent with upcoming blog post (#1708 )

2018-03-15 15:57:31 -07:00

[rllib] Set num_cpu=None for workers in the default settings (#1793 )

2018-03-29 16:33:40 -07:00

[rllib] [docs] Cleanup RLlib API and make docs consistent with upcoming blog post (#1708 )

2018-03-15 15:57:31 -07:00

[tune] Change tune resource request syntax to be less confusing (#1764 )

2018-03-23 06:25:01 -07:00

[rllib] [docs] Cleanup RLlib API and make docs consistent with upcoming blog post (#1708 )

2018-03-15 15:57:31 -07:00

__init__.py

[rllib] [docs] Cleanup RLlib API and make docs consistent with upcoming blog post (#1708 )

2018-03-15 15:57:31 -07:00

agent.py

Remove from X import Y convention in RLlib ES. (#1774 )

2018-03-23 05:54:31 -07:00

README.rst

[rllib] remove redundant docs (#1728 )

2018-03-17 14:45:04 -07:00

rollout.py

[rllib] Rollout script needs to pipe in config and update states (#1566 )

2018-02-20 12:04:41 -08:00

train.py

[tune] Change tune resource request syntax to be less confusing (#1764 )

2018-03-23 06:25:01 -07:00

README.rst

Ray RLlib: Scalable Reinforcement Learning
==========================================

Ray RLlib is an RL execution toolkit built on the Ray distributed execution framework. See the `user documentation <http://ray.readthedocs.io/en/latest/rllib.html>`__ and `paper <https://arxiv.org/abs/1712.09381>`__.

RLlib includes the following reference algorithms:

-  `Proximal Policy Optimization (PPO) <https://arxiv.org/abs/1707.06347>`__ which
   is a proximal variant of `TRPO <https://arxiv.org/abs/1502.05477>`__.

-  `The Asynchronous Advantage Actor-Critic (A3C) <https://arxiv.org/abs/1602.01783>`__.

- `Deep Q Networks (DQN) <https://arxiv.org/abs/1312.5602>`__.

- `Ape-X Distributed Prioritized Experience Replay <https://arxiv.org/abs/1803.00933>`__.

-  Evolution Strategies, as described in `this
   paper <https://arxiv.org/abs/1703.03864>`__. Our implementation
   is adapted from
   `here <https://github.com/openai/evolution-strategies-starter>`__.

These algorithms can be run on any OpenAI Gym MDP, including custom ones written and registered by the user.