Commit Graph

8 Commits

Author SHA1 Message Date
Eric Liang 715737cc06 [docs] Add backlinks from hyperopt / rl algorithm examples to the built-on Ray libraries (#1356) 2017-12-23 00:31:33 -08:00
Eric Liang fbf1806b8a [tune] Clean up result logging: move out of /tmp, add timestamp (#1297) 2017-12-15 14:19:08 -08:00
Eric Liang 316f9e2bb7 [tune] Support user-defined trainable functions / classes / envs with a shared object registry (#1226) 2017-11-20 17:52:43 -08:00
Eric Liang 90013eda2d [rllib] Fix docs to reference new code locations (#1092)
* fix rllib docs

* Update example-a3c.rst
2017-10-09 22:58:58 -07:00
Eric Liang a674ec958c [rllib] Move policy gradient and evolution strategies algorithms from examples/ to ray/rllib/ (#694)
* rllib v0

* fix imports

* lint

* comments

* update docs
2017-06-25 22:13:03 +00:00
Eric Liang 06241daf61 Policy gradient example: record stats for tensorboard (#577)
* add tf metrics

* comments

* fix network scopes

* add doc

* use format string

* fix trace level

* plot intermediate and final sgd stats

* add back a global step
2017-05-21 14:51:24 -07:00
Philipp Moritz 4af0aa6258 Atari on pixels (#364)
* pong on pixels working (not cleaned up)

* make training compatible with all atari games

* cartpole runs

* Update documentation and usage for policy gradients.
2017-03-14 13:31:29 -07:00
Philipp Moritz 555dcf35a2 Add policy gradient example. (#344)
* add policy gradient example

* fix typos

* Minor changes plus some documentation.

* Minor fixes.
2017-03-07 23:42:44 -08:00