Commit Graph

373 Commits

Author SHA1 Message Date
Eric Liang 02583a8598 [rllib] Rename PolicyGraph => Policy, move from evaluation/ to policy/ (#4819)
This implements some of the renames proposed in #4813
We leave behind backwards-compatibility aliases for *PolicyGraph and SampleBatch.
2019-05-20 16:46:05 -07:00
Philipp Moritz 84cf474abc Change tagline in documentation and README. (#4807)
* Update README.rst, index.rst, tutorial.rst and  _config.yml
2019-05-17 13:34:14 +08:00
Eric Liang 3807fb505b [rllib] TensorFlow 2 compatibility (#4802) 2019-05-16 22:12:07 -07:00
Eric Liang 7d5ef6d99c [rllib] Support continuous action distributions in IMPALA/APPO (#4771) 2019-05-16 22:05:07 -07:00
Adi Zimmerman f3b8b9093d Update tutorial link in doc (#4777) 2019-05-12 15:08:47 -07:00
Romil Bhardwaj 004440f526 Dynamic Custom Resources - create and delete resources (#3742) 2019-05-11 20:06:04 +08:00
Adi Zimmerman 28d381373d [tune] Add Ax to Tune (#4731) 2019-05-08 15:54:29 -07:00
Eric Liang 71b2dec3b4 [rllib] Fix bounds of space returned by preprocessor.observation_space (#4736) 2019-05-05 18:25:38 -07:00
Si-Yuan bd00735fe8 Fix tempfile issues (#4605) 2019-05-05 16:06:15 -07:00
Richard Liaw f2faf5ce75 [tune] Contributor Guide and Design Page (#4716)
* Move setup script out

* some changes

* Finished Contributor guide

* some comments to the design

* move

* Apply suggestions from code review

Co-Authored-By: richardliaw <rliaw@berkeley.edu>

* sourcecode

* comments
2019-05-05 00:04:13 -07:00
Adi Zimmerman 36b71d1446 [Tune] Post-Experiment Tools (#4351) 2019-05-04 02:51:26 -04:00
William Ma 406c429384 [doc] Remove instructions for installing Python2 for webui. (#4740) 2019-05-02 22:35:23 -07:00
Andrew Tan f87235f232 [tune] Example for Tune blog post (#4673) 2019-05-02 13:16:48 -04:00
Sam Toyer 663e92ab3f [rllib] TD3/DDPG improvements and MuJoCo benchmarks (#4694)
* [rllib] Separate optimisers for DDPG actor & crit.

* [rllib] Better names for DDPG variables & options

Config changes:

- noise_scale -> exploration_ou_noise_scale
- exploration_theta -> exploration_ou_theta
- exploration_sigma -> exploration_ou_sigma
- act_noise -> exploration_gaussian_sigma
- noise_clip -> target_noise_clip

* [rllib] Make DDPG less class-y

Used functions to replace three classes with only an __init__ method & a
handful of unrelated attributes.

* [rllib] Refactor DDPG noise

* [rllib] Unify DDPG exploration annealing

Added option "exploration_should_anneal" to enable linear annealing of
exploration noise. By default this is off, for consistency with DDPG &
TD3 papers. Also renamed "exploration_final_eps" to
"exploration_final_scale" (that name seems to have been carried over
from DQN, and doesn't really make sense here). Finally, tried to rename
"eps" to "noise_scale" wherever possible.
2019-04-26 17:49:53 -07:00
Andy Jones 05c896d6f7 Adds a note on how to avoid contention when using PyTorch. (#4692) 2019-04-26 13:22:26 -07:00
Eric Liang 6848dfd179 [rllib] Replace ray.get() with ray_get_and_free() to optimize memory usage (#4586) 2019-04-17 20:30:03 -04:00
Eric Liang 6e7680bf21 [rllib] Clean up concepts documentation and policy optimizer creation (#4592) 2019-04-12 21:03:26 -07:00
Eric Liang f8e8743347 [tune] Improve PBT example (#4575) 2019-04-09 20:59:17 -07:00
Eric Liang 4f46d3e9bf [rllib] Add multi-agent examples for hand-coded policy, centralized VF (#4554) 2019-04-09 00:36:49 -07:00
Eric Liang 37208216ae [rllib] Rename Agent to Trainer (#4556) 2019-04-07 00:36:18 -07:00
Andrew Tan 991b911e1d [tune] Add --columns flag for CLI (#4564) 2019-04-05 19:49:01 -07:00
Andrew Tan bfd0af52bc [tune] Add documentation to --output flag (#4518)
## What do these changes do?

Add documentation for the `--output` flag for ls / lsx in the Tune CLI.

## Related issue number

Closes #4511 

## Linter

- [x] I've run `scripts/format.sh` to lint the changes in this PR.
2019-04-05 00:16:35 -07:00
Tasha Chin 5693cd1344 [docs] Show source code (#3281) 2019-04-03 21:30:20 -07:00
Yuhong Guo c2349cf12d Remove local/global_scheduler from code and doc. (#4549) 2019-04-03 17:05:09 -07:00
Adi Zimmerman 51dae23d5c [tune] Search Alg delay import + CLI timing test (#4230) 2019-04-03 08:52:45 -07:00
Philipp Moritz b0f6ddf6d1 Remove CMake files (#4493) 2019-04-02 22:17:33 -07:00
Eric Liang fce0062380 [rllib] Switch to tune.run() instead of run_experiments() (#4515) 2019-03-30 14:07:50 -07:00
bjg2 77005d1814 [rllib] Make batch timeout for remote workers tunable (#4435) 2019-03-29 13:19:42 -07:00
Eric Liang 2871609296 [rllib] Report sampler performance metrics (#4427) 2019-03-27 13:24:23 -07:00
Andrew Tan 12db684f72 [tune] add filter flag for Tune CLI (#4337)
## What do these changes do?

Adds filter flag (--filter) to ls / lsx commands for Tune CLI.

Usage: `tune ls [path] --filter [column] [operator] [value]`
e.g. `tune lsx ~/ray_results/my_project --filter total_trials == 1`
2019-03-27 11:19:25 -07:00
Robert Nishihara c6f12e5219 Update documentation from 0.7.0.dev1 to 0.7.0.dev2. (#4485) 2019-03-26 17:32:53 -07:00
Eric Liang cff08e19ff [rllib] Print out intermediate data shapes on the first iteration (#4426) 2019-03-26 00:27:59 -07:00
Eric Liang 5b8eb475ce [rllib] Allow None to be specified in multi-agent envs (#4464)
* wip

* check

* doc update

* Update hierarchical_training.py
2019-03-25 11:38:17 -07:00
Eric Liang 4b8b703561 [rllib] Some API cleanups and documentation improvements (#4409) 2019-03-21 21:34:22 -07:00
Philipp Moritz 80ef8c19aa Add initial news reader example (#4348) 2019-03-20 18:47:12 -07:00
Richard Liaw ea5a6f8455 [tune] Simplify API (#4234)
Uses `tune.run` to execute experiments as preferred API.

@noahgolmant

This does not break backwards compat, but will slowly internalize `Experiment`. 

In a separate PR, Tune schedulers should only support 1 running experiment at a time.
2019-03-17 13:03:32 -07:00
Philipp Moritz c5e2c9af4d Build wheels for macOS with Bazel (#4280) 2019-03-15 10:37:57 -07:00
Richard Liaw 6630a35353 [tune] Initial Commit for Tune CLI (#3983)
This introduces a light CLI for Tune.
2019-03-08 16:46:05 -08:00
Eric Liang c7f74dbdc7 [rllib] Add async remote workers (#4253) 2019-03-08 15:39:48 -08:00
Philipp Moritz 39eed24d47 update version from 0.7.0.dev0 to 0.7.0.dev1 (#4282) 2019-03-06 14:43:09 -08:00
Eric Liang 78ad9c4cbb Add "ray timeline" command to auto-dump Chrome trace for the current Ray instance (#4239) 2019-03-05 16:28:00 -08:00
Adi Zimmerman 4cf2c9ecb8 [tune] Doc fixes (#4207)
Co-Authored-By: adizim <adizim@berkeley.edu>
2019-03-05 14:11:53 -08:00
Eric Liang 6e3384a719 [rllib] Add three new long-running stress tests {APEX, IMPALA, PBT} (#4215) 2019-03-04 14:05:42 -08:00
Eric Liang ba03048254 [rllib] TF model custom_loss() should actually allow access to full rollout data (#4220) 2019-03-02 22:57:51 -08:00
Robert Nishihara 4b89eebfc7 Move test folders under rllib/tune from test -> tests. (#4214) 2019-03-02 13:37:16 -08:00
Robert Nishihara f21e6a2cff Update documentation regarding UI and timeline. (#4189) 2019-03-01 19:54:33 -08:00
Richard Liaw c695402dc3 [tune] Introduce ability to turn off default logging. (#4104) 2019-02-28 17:02:41 -08:00
Ion 44afcf4fa8 [doc] Document experimental signal API. (#4019)
* [doc] Document signal API.

* minor

* resolve conflicts
2019-02-28 11:05:28 -08:00
Robert Nishihara 641f703879 Update installation instructions to include bazel and remove outdated… (#4171) 2019-02-26 23:07:43 -08:00
Adi Zimmerman 5cf388f29d [tune] Support RESTful API for the Web Server (#4080)
Change the client/server API to RESTful design. This includes resource modeling, model URI's, and correct HTTP methods.
2019-02-26 21:56:02 -08:00