ray/python/ray/rllib at d47d6a6b7aa5ab195fd9b01b9da8880e52e07aea - ray - Gitea: Git with a cup of tea

wassname/ray

mirror of https://github.com/wassname/ray.git synced 2026-06-28 22:20:52 +08:00

Files

T

History

Alok Singh d47d6a6b7a [rllib] Use correct method name (#2226 )

2018-06-11 09:53:31 -07:00

..

[rllib] Refactor rllib to have a common sample collection pathway (#2149 )

2018-06-09 00:21:35 -07:00

[rllib] Refactor rllib to have a common sample collection pathway (#2149 )

2018-06-09 00:21:35 -07:00

[rllib] Refactor rllib to have a common sample collection pathway (#2149 )

2018-06-09 00:21:35 -07:00

[rllib] Refactor rllib to have a common sample collection pathway (#2149 )

2018-06-09 00:21:35 -07:00

Use flake8-comprehensions (#1976 )

2018-05-20 16:15:06 -07:00

[rllib] Upgrade to OpenAI Gym 0.10.3 (#1601 )

2018-03-06 00:31:02 -08:00

[rllib] Refactor rllib to have a common sample collection pathway (#2149 )

2018-06-09 00:21:35 -07:00

[rllib] Refactor rllib to have a common sample collection pathway (#2149 )

2018-06-09 00:21:35 -07:00

[rllib] Refactor rllib to have a common sample collection pathway (#2149 )

2018-06-09 00:21:35 -07:00

[rllib] Refactor rllib to have a common sample collection pathway (#2149 )

2018-06-09 00:21:35 -07:00

[rllib] Refactor rllib to have a common sample collection pathway (#2149 )

2018-06-09 00:21:35 -07:00

[rllib] Merge DDPG and DDPG2 implementations (#2202 )

2018-06-09 16:46:23 -07:00

[rllib] Use correct method name (#2226 )

2018-06-11 09:53:31 -07:00

__init__.py

[rllib] Merge DDPG and DDPG2 implementations (#2202 )

2018-06-09 16:46:23 -07:00

agent.py

[rllib] Merge DDPG and DDPG2 implementations (#2202 )

2018-06-09 16:46:23 -07:00

README.rst

[rllib] Merge DDPG and DDPG2 implementations (#2202 )

2018-06-09 16:46:23 -07:00

rollout.py

[rllib] Rollout script needs to pipe in config and update states (#1566 )

2018-02-20 12:04:41 -08:00

train.py

[tune] [rllib] Automatically determine RLlib resources and add queueing mechanism for autoscaling (#1848 )

2018-04-16 16:58:15 -07:00

README.rst

Ray RLlib: Scalable Reinforcement Learning
==========================================

Ray RLlib is an RL execution toolkit built on the Ray distributed execution framework. See the `user documentation <http://ray.readthedocs.io/en/latest/rllib.html>`__ and `paper <https://arxiv.org/abs/1712.09381>`__.

RLlib includes the following reference algorithms:

- Proximal Policy Optimization (`PPO <https://github.com/ray-project/ray/tree/master/python/ray/rllib/ppo>`__) which is a proximal variant of `TRPO <https://arxiv.org/abs/1502.05477>`__.

- Policy Gradients (`PG <https://github.com/ray-project/ray/tree/master/python/ray/rllib/pg>`__).

- Asynchronous Advantage Actor-Critic (`A3C <https://github.com/ray-project/ray/tree/master/python/ray/rllib/a3c>`__).

- Deep Q Networks (`DQN <https://github.com/ray-project/ray/tree/master/python/ray/rllib/dqn>`__).

- Deep Deterministic Policy Gradients (`DDPG <https://github.com/ray-project/ray/tree/master/python/ray/rllib/ddpg>`__).

- Ape-X Distributed Prioritized Experience Replay, including both `DQN <https://github.com/ray-project/ray/blob/master/python/ray/rllib/dqn/apex.py>`__ and `DDPG <https://github.com/ray-project/ray/blob/master/python/ray/rllib/ddpg/apex.py>`__ variants.

- Evolution Strategies (`ES <https://github.com/ray-project/ray/tree/master/python/ray/rllib/es>`__), as described in `this paper <https://arxiv.org/abs/1703.03864>`__.

These algorithms can be run on any OpenAI Gym MDP, including custom ones written and registered by the user.