Commit Graph

404 Commits

Author SHA1 Message Date
Eric Liang a62c5f40f6 [rllib] Document ModelV2 and clean up the models/ directory (#5277) 2019-07-27 02:08:16 -07:00
Eric Liang bf9199ad77 [rllib] ModelV2 support for pytorch (#5249) 2019-07-25 11:02:53 -07:00
Peter Schafhalter fc589050c9 [sgd] Deprecate old distributed SGD implementation (#5160)
* Deprecate old distributed SGD implementation

* Update README
2019-07-22 15:47:10 -07:00
Vince Jankovics 80b976efcb Ray namespace added for k8s (#4111)
* Ray namespace added for k8s

* Submit.yaml update with k8s namespace

* K8s deployment doc update with namespace
2019-07-22 15:45:05 -07:00
Eric Liang f9043cc49a [rllib] Remove experimental eager support 2019-07-21 12:27:17 -07:00
Eric Liang 047f4ccd61 [rllib] Fix rollout.py with tuple action space (#5201)
* fix it

* update doc too

* fix rollout
2019-07-16 10:52:35 -07:00
Richard Liaw b6509f46b0 Update wheels to 0.8.0dev2 (#5186) 2019-07-12 17:27:03 -07:00
Kai Yang 43b6513d19 [GCS] Move node resource info from client table to resource table (#5050) 2019-07-11 13:17:19 +08:00
Richard Liaw 0b540ab492 [tune] Test example checkpointing (#4728) 2019-07-10 01:58:26 -07:00
Eric Liang 34d054ff19 [rllib] ModelV2 API (#4926) 2019-07-03 15:59:47 -07:00
Kristian Hartikainen 9e0192bc0b [tune] Change the log syncing behavior (#4450)
* Change the log syncing behavior

* fix up abstractions for syncer

* Finished checkpoint syncing

* Code

* Set of changes to get things running

* Fixes for log syncing

* Fix parts

* Lint and other fixes

* fix some test

* Remove extra parsing functionality

* some test fixes

* Fix up cloud syncing

* Another thing to do

* Fix up tests and local sync

Changes LogSync into a mixin, and adds tests for different
functionalities.

* Fix up tests, start on local migration

* fix distributed migrations

* comments

* formatting

* Better checkpoint directory handling

* fix tests

* fix tests

* fix click

* comments

* formatting comments

* formatting and comments

* sync function deprecations

* syncfunction

* Add documentation for Syncing and Uploading

* nit

* BaseSyncer as base for Mixin in edge case

* more docs

* clean up assertions

* validate

* nit

* Update test_cluster.py

* betterdoc

* Update tune-usage.rst

* cleanup

* nit
2019-07-02 20:46:00 -07:00
Hao Chen 0131353d42 [gRPC] Migrate gcs data structures to protobuf (#5024) 2019-06-25 14:31:19 -07:00
Ashwinee Panda 11ccf66346 [docs] docs for running Tensorboard without sudo (#5015)
* Instructions for running Tensorboard without sudo

When we run Tensorboard to visualize the results of Ray outputs on multi-user clusters where we don't have sudo access, such as RISE clusters, a few commands need to first be run to make sure tensorboard can edit the tmp directory. This is a pretty common usecase so I figured we may as well put it in the documentation for Tune.

* Update tune-usage.rst
2019-06-24 11:26:53 -07:00
Tianhong Dai 1b86e551fb Fix bugs in the a3c code template. (#4984) 2019-06-14 17:22:36 -07:00
Eric Liang 77689d1116 [rllib] Port remainder of algorithms to build_trainer() pattern (#4920) 2019-06-07 16:45:36 -07:00
Eric Liang 9e328fbe6f [rllib] Add docs on how to use TF eager execution (#4927) 2019-06-07 16:42:37 -07:00
Yuhong Guo 5eff47b657 [C++] Add hash table to Redis-Module (#4911) 2019-06-07 16:11:37 +08:00
Stephanie Wang cbc67fc750 [doc] Update developer docs with bazel instructions (#4944) 2019-06-06 18:18:24 -07:00
Robert Nishihara c3f8fc1c44 Update version number in documentation after release 0.7.0 -> 0.7.1 and 0.8.0.dev0 -> 0.8.0.dev1. (#4941) 2019-06-06 17:22:45 -07:00
Timon Ruban 2702b15b04 [tune] Add requirements-dev.txt and update docs for contributing (#4925)
* Add requirements-dev.txt and update docs.

* Update doc/source/tune-contrib.rst

Co-Authored-By: Richard Liaw <rliaw@berkeley.edu>

* Unpin everything except for yapf.
2019-06-05 09:04:36 -07:00
Hersh Godse 89722ff003 [tune] Directional metrics for components (#4120) (#4915) 2019-06-02 22:13:40 -07:00
Eric Liang 7501ee51db [rllib] Rename PolicyEvaluator => RolloutWorker (#4820) 2019-06-03 06:49:24 +08:00
Peter Schafhalter c2ade075a3 [sgd] Distributed Training via PyTorch (#4797)
Implements distributed SGD using distributed PyTorch.
2019-06-01 21:39:22 -07:00
Eric Liang 9aa1cd613d [rllib] Allow Torch policies access to full action input dict in extra_action_out_fn (#4894)
* fix torch extra out

* preserve setitem

* fix docs
2019-06-01 16:58:49 +08:00
Eric Liang 1c073e92e4 [rllib] Fix documentation on custom policies (#4910)
* wip

* add docs

* lint

* todo sections

* fix doc
2019-06-01 16:13:21 +08:00
Eric Liang 2dd0beb5bd [rllib] Allow access to batches prior to postprocessing (#4871) 2019-05-29 18:17:14 -07:00
Eric Liang a45c61e19b [rllib] Update concepts docs and add "Building Policies in Torch/TensorFlow" section (#4821)
* wip

* fix index

* fix bugs

* todo

* add imports

* note on get ph

* note on get ph

* rename to building custom algs

* add rnn state info
2019-05-27 14:17:32 -07:00
Richard Liaw 574e1c7695 [tune] Fix up Ax Search and Examples (#4851)
* update Ax for cleaner API

* docs update
2019-05-27 13:23:17 -07:00
Robert Nishihara 6703519144 Move global state API out of global_state object. (#4857) 2019-05-26 11:27:53 -07:00
Devin Petersohn a7d01aba9b Update wheel versions in documentation to 0.8.0.dev0 and 0.7.0. (#4847) 2019-05-24 16:49:13 -07:00
Robert Nishihara 49fe894e22 Export remote functions when first used and also fix bug in which rem… (#4844)
* Export remote functions when first used and also fix bug in which remote functions and actor classes are not exported from workers during subsequent ray sessions.

* Documentation update

* Fix tests.

* Fix grammar
2019-05-24 13:44:39 -07:00
Eric Liang 02583a8598 [rllib] Rename PolicyGraph => Policy, move from evaluation/ to policy/ (#4819)
This implements some of the renames proposed in #4813
We leave behind backwards-compatibility aliases for *PolicyGraph and SampleBatch.
2019-05-20 16:46:05 -07:00
Philipp Moritz 84cf474abc Change tagline in documentation and README. (#4807)
* Update README.rst, index.rst, tutorial.rst and  _config.yml
2019-05-17 13:34:14 +08:00
Eric Liang 3807fb505b [rllib] TensorFlow 2 compatibility (#4802) 2019-05-16 22:12:07 -07:00
Eric Liang 7d5ef6d99c [rllib] Support continuous action distributions in IMPALA/APPO (#4771) 2019-05-16 22:05:07 -07:00
Adi Zimmerman f3b8b9093d Update tutorial link in doc (#4777) 2019-05-12 15:08:47 -07:00
Romil Bhardwaj 004440f526 Dynamic Custom Resources - create and delete resources (#3742) 2019-05-11 20:06:04 +08:00
Adi Zimmerman 28d381373d [tune] Add Ax to Tune (#4731) 2019-05-08 15:54:29 -07:00
Eric Liang 71b2dec3b4 [rllib] Fix bounds of space returned by preprocessor.observation_space (#4736) 2019-05-05 18:25:38 -07:00
Si-Yuan bd00735fe8 Fix tempfile issues (#4605) 2019-05-05 16:06:15 -07:00
Richard Liaw f2faf5ce75 [tune] Contributor Guide and Design Page (#4716)
* Move setup script out

* some changes

* Finished Contributor guide

* some comments to the design

* move

* Apply suggestions from code review

Co-Authored-By: richardliaw <rliaw@berkeley.edu>

* sourcecode

* comments
2019-05-05 00:04:13 -07:00
Adi Zimmerman 36b71d1446 [Tune] Post-Experiment Tools (#4351) 2019-05-04 02:51:26 -04:00
William Ma 406c429384 [doc] Remove instructions for installing Python2 for webui. (#4740) 2019-05-02 22:35:23 -07:00
Andrew Tan f87235f232 [tune] Example for Tune blog post (#4673) 2019-05-02 13:16:48 -04:00
Sam Toyer 663e92ab3f [rllib] TD3/DDPG improvements and MuJoCo benchmarks (#4694)
* [rllib] Separate optimisers for DDPG actor & crit.

* [rllib] Better names for DDPG variables & options

Config changes:

- noise_scale -> exploration_ou_noise_scale
- exploration_theta -> exploration_ou_theta
- exploration_sigma -> exploration_ou_sigma
- act_noise -> exploration_gaussian_sigma
- noise_clip -> target_noise_clip

* [rllib] Make DDPG less class-y

Used functions to replace three classes with only an __init__ method & a
handful of unrelated attributes.

* [rllib] Refactor DDPG noise

* [rllib] Unify DDPG exploration annealing

Added option "exploration_should_anneal" to enable linear annealing of
exploration noise. By default this is off, for consistency with DDPG &
TD3 papers. Also renamed "exploration_final_eps" to
"exploration_final_scale" (that name seems to have been carried over
from DQN, and doesn't really make sense here). Finally, tried to rename
"eps" to "noise_scale" wherever possible.
2019-04-26 17:49:53 -07:00
Andy Jones 05c896d6f7 Adds a note on how to avoid contention when using PyTorch. (#4692) 2019-04-26 13:22:26 -07:00
Eric Liang 6848dfd179 [rllib] Replace ray.get() with ray_get_and_free() to optimize memory usage (#4586) 2019-04-17 20:30:03 -04:00
Eric Liang 6e7680bf21 [rllib] Clean up concepts documentation and policy optimizer creation (#4592) 2019-04-12 21:03:26 -07:00
Eric Liang f8e8743347 [tune] Improve PBT example (#4575) 2019-04-09 20:59:17 -07:00
Eric Liang 4f46d3e9bf [rllib] Add multi-agent examples for hand-coded policy, centralized VF (#4554) 2019-04-09 00:36:49 -07:00