mirror of https://github.com/wassname/ray.git synced 2026-07-01 03:59:39 +08:00

Files

T

Jones Wong 319c1340cb [rllib] Develop MARWIL (#3635 )

*  add marvil policy graph

*  fix typo

*  add offline optimizer and enable running marwil

*  fix loss function

*  add maintaining the moving average of advantage norm

*  use sync replay optimizer for unifying

*  remove offline optimizer and use sync replay optimizer

*  format by yapf

*  add imitation learning objective

*  fix according to eric's review

*  format by yapf

* revise

* add test data

* marwil

2019-01-16 19:00:43 -08:00

agents

[rllib] Develop MARWIL (#3635 )

2019-01-16 19:00:43 -08:00

contrib

[rllib] Add requested clarifications to test requirement of contrib docs (#3589 )

2018-12-21 11:02:02 -08:00

env

[rllib] Q-Mix implementation (Q-Mix, VDN, IQN, and Ape-X variants) (#3548 )

2018-12-18 10:40:01 -08:00

evaluation

Change timeout from milliseconds to seconds in ray.wait. (#3706 )

2019-01-08 21:32:08 -08:00

examples

[rllib] Documentation for I/O API and multi-agent support / cleanup (#3650 )

2019-01-03 15:15:36 +08:00

models

[rllib] Refactor pytorch custom model support (#3634 )

2019-01-03 13:48:33 +08:00

offline

[rllib] Documentation for I/O API and multi-agent support / cleanup (#3650 )

2019-01-03 15:15:36 +08:00

optimizers

[rllib] Develop MARWIL (#3635 )

2019-01-16 19:00:43 -08:00

test

[rllib] Develop MARWIL (#3635 )

2019-01-16 19:00:43 -08:00

tuned_examples

[rllib] Develop MARWIL (#3635 )

2019-01-16 19:00:43 -08:00

utils

Change timeout from milliseconds to seconds in ray.wait. (#3706 )

2019-01-08 21:32:08 -08:00

__init__.py

[rllib] [rfc] add contrib module and guideline for merging (#3565 )

2018-12-20 10:44:34 -08:00

asv.conf.json

[rllib][asv] Support ASV for RLlib (#2304 )

2018-06-28 17:20:09 -07:00

README.md

[rllib] Fix stats collection and some docs bugs since the refactoring (#2361 )

2018-07-07 13:29:20 -07:00

rollout.py

[rllib] fix for rollout of lstm policies (#3643 )

2019-01-13 15:54:23 -08:00

scripts.py

[docs] Switch docs to use rllib train instead of train.py

2018-12-04 17:36:06 -08:00

setup-rllib-dev.py

[rllib] Allow development without needing to compile Ray (#3623 )

2018-12-24 18:08:23 +09:00

train.py

Remove num_local_schedulers argument from ray.worker._init. (#3704 )

2019-01-07 12:44:49 -08:00

README.md

RLlib: Scalable Reinforcement Learning

RLlib is an open-source library for reinforcement learning that offers both a collection of reference algorithms and scalable primitives for composing new ones.

For an overview of RLlib, see the documentation.

If you've found RLlib useful for your research, you can cite the paper as follows:

@inproceedings{liang2018rllib,
    Author = {Eric Liang and
              Richard Liaw and
              Robert Nishihara and
              Philipp Moritz and
              Roy Fox and
              Ken Goldberg and
              Joseph E. Gonzalez and
              Michael I. Jordan and
              Ion Stoica},
    Title = {{RLlib}: Abstractions for Distributed Reinforcement Learning},
    Booktitle = {International Conference on Machine Learning ({ICML})},
    Year = {2018}
}