mirror of https://github.com/wassname/ray.git synced 2026-06-28 14:48:54 +08:00

Files

T

Kristian Hartikainen 13fb9fe3db [rllib] Feature/soft actor critic v2 (#5328 )

* Add base for Soft Actor-Critic

* Pick changes from old SAC branch

* Update sac.py

* First implementation of sac model

* Remove unnecessary SAC imports

* Prune unnecessary noise and exploration code

* Implement SAC model and use that in SAC policy

* runs but doesn't learn

* clear state

* fix batch size

* Add missing alpha grads and vars

* -200 by 2k timesteps

* doc

* lazy squash

* one file

* ignore tfp

* revert done

2019-08-01 23:37:36 -07:00

agents

[rllib] Feature/soft actor critic v2 (#5328 )

2019-08-01 23:37:36 -07:00

contrib

[rllib] Rename Agent to Trainer (#4556 )

2019-04-07 00:36:18 -07:00

env

[rllib] Rename PolicyEvaluator => RolloutWorker (#4820 )

2019-06-03 06:49:24 +08:00

evaluation

[rllib] Feature/soft actor critic v2 (#5328 )

2019-08-01 23:37:36 -07:00

examples

[rllib] Add rock paper scissors multi-agent example (#5336 )

2019-08-01 13:03:59 -07:00

models

[rllib] Add rock paper scissors multi-agent example (#5336 )

2019-08-01 13:03:59 -07:00

offline

[rllib] Rename PolicyEvaluator => RolloutWorker (#4820 )

2019-06-03 06:49:24 +08:00

optimizers

[rllib] Importance Sampling and KL Loss for APPO (#5051 )

2019-07-29 15:02:32 -07:00

policy

[rllib] Importance Sampling and KL Loss for APPO (#5051 )

2019-07-29 15:02:32 -07:00

tests

[rllib] Feature/soft actor critic v2 (#5328 )

2019-08-01 23:37:36 -07:00

tuned_examples

[rllib] Feature/soft actor critic v2 (#5328 )

2019-08-01 23:37:36 -07:00

utils

[rllib] Feature/soft actor critic v2 (#5328 )

2019-08-01 23:37:36 -07:00

__init__.py

[rllib] Rename PolicyEvaluator => RolloutWorker (#4820 )

2019-06-03 06:49:24 +08:00

asv.conf.json

[rllib][asv] Support ASV for RLlib (#2304 )

2018-06-28 17:20:09 -07:00

keras_policy.py

[rllib] Rename PolicyGraph => Policy, move from evaluation/ to policy/ (#4819 )

2019-05-20 16:46:05 -07:00

README.md

[rllib] Report sampler performance metrics (#4427 )

2019-03-27 13:24:23 -07:00

rollout.py

Fix Tuple spaces in rollout.py (#5332 )

2019-07-31 11:38:49 -07:00

scripts.py

[docs] Switch docs to use rllib train instead of train.py

2018-12-04 17:36:06 -08:00

train.py

[sgd] Replaced class Resources in sgd with use_gpu (#5252 )

2019-08-01 01:03:10 -07:00

README.md

RLlib: Scalable Reinforcement Learning

RLlib is an open-source library for reinforcement learning that offers both high scalability and a unified API for a variety of applications.

For an overview of RLlib, see the documentation.

If you've found RLlib useful for your research, you can cite the paper as follows:

@inproceedings{liang2018rllib,
    Author = {Eric Liang and
              Richard Liaw and
              Robert Nishihara and
              Philipp Moritz and
              Roy Fox and
              Ken Goldberg and
              Joseph E. Gonzalez and
              Michael I. Jordan and
              Ion Stoica},
    Title = {{RLlib}: Abstractions for Distributed Reinforcement Learning},
    Booktitle = {International Conference on Machine Learning ({ICML})},
    Year = {2018}
}