Commit Graph

111 Commits

Author SHA1 Message Date
Richard Liaw 411f30c125 [docs] Second push of changes (#5391) 2019-08-28 17:54:15 -07:00
Eric Liang 97ccd75952 [rllib] Enable object store memory limit by default (#5534) 2019-08-26 01:37:28 -07:00
gehring b520f6141e [rllib] Adds eager support with a generic TFEagerPolicy class (#5436) 2019-08-23 14:21:11 +08:00
Richard Liaw cdc9227f1b [tune] ASHA xgboost and lightgbm examples (#5500) 2019-08-22 10:37:59 -07:00
Philipp Moritz e065f55998 Fix impala stress test (#5491) 2019-08-20 19:49:46 -07:00
Robert Nishihara 851c5b2dae Add a script for benchmarking performance for Ray developers. (#5472) 2019-08-19 23:41:23 -07:00
Richard Liaw d7b309223b [tune] MLFlow Logger (#5438) 2019-08-14 15:58:18 -07:00
Lisa Dunlap b7d0733362 [tune] Implement BOHB (#5382) 2019-08-13 12:32:07 -07:00
Eric Liang a1d2e17623 [rllib] Autoregressive action distributions (#5304) 2019-08-10 14:05:12 -07:00
jichan3751 de95117e96 [sgd] Tune interface for Pytorch MultiNode SGD (#5350) 2019-08-10 13:51:44 -07:00
Simon Mo 18f1e904de Bump 0.8.0.dev2 -> 0.8.0.dev3 (#5409) 2019-08-09 11:37:19 -07:00
Eric Liang 1a8fa5d2fa Clean up top level Ray dir (#5404) 2019-08-08 23:35:55 -07:00
Eric Liang 592f313210 [rllib] Centralized critic / PPO example on TwoStepGame (#5392) 2019-08-08 14:03:28 -07:00
Simon Mo 50b93bf179 Check upstream with git remote (#5377) 2019-08-06 17:30:33 -07:00
Wonseok Jeon 281829e712 MADDPG implementation in RLlib (#5348) 2019-08-06 16:22:06 -07:00
Eric Liang 5d7afe8092 [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
Richard Liaw 1eaa57c98f [tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
Eric Liang 3bdd114282 [rllib] Better example rnn envs (#5300) 2019-07-28 14:07:18 -07:00
Eric Liang a62c5f40f6 [rllib] Document ModelV2 and clean up the models/ directory (#5277) 2019-07-27 02:08:16 -07:00
Richard Liaw 7e715520e5 [sgd] Example for Training (#5292) 2019-07-27 01:10:25 -07:00
Eric Liang f9043cc49a [rllib] Remove experimental eager support 2019-07-21 12:27:17 -07:00
Eric Liang d58b986858 [rllib] MultiCategorical shouldn't return array for kl or entropy (#5215)
* wip

* fix
2019-07-19 12:12:04 -07:00
Jones Wong 0af07bd493 Enable seeding actors for reproducible experiments (#5197)
*  enable graph-level worker-specific seed

*  lint checked

*  revised according to eric's suggestions

*  revised accordingly and added a test case

*  formated

* Update test_reproducibility.py

* Update trainer.py

* Update rollout_worker.py

* Update run_rllib_tests.sh

* Update worker_set.py
2019-07-17 23:31:34 -07:00
Eric Liang 047f4ccd61 [rllib] Fix rollout.py with tuple action space (#5201)
* fix it

* update doc too

* fix rollout
2019-07-16 10:52:35 -07:00
Simon Mo fd71ffde2f Improve release process 0.7.2 (#5187) 2019-07-15 14:46:54 -07:00
Richard Liaw b6509f46b0 Update wheels to 0.8.0dev2 (#5186) 2019-07-12 17:27:03 -07:00
Philipp Moritz ccee77aafd fix node_failures.py (#5167) 2019-07-11 11:40:13 -07:00
Hao Chen 0c34749779 Use bazel disk cache for all CI jobs (#5144) 2019-07-10 22:03:45 +08:00
Richard Liaw 0b540ab492 [tune] Test example checkpointing (#4728) 2019-07-10 01:58:26 -07:00
Eric Liang 5ab5017c67 [rllib] Fix impala stress test (#5101)
* add copy

* upgrade to tf 1.14

* update

* reduce count to workaround https://github.com/ray-project/ray/issues/5125

* Update impala.py

* placeholder

* comments

* update
2019-07-09 20:22:30 -07:00
Joey Jiang b4e51c8aa1 Support clang-format whose version is not 7.0 (#5139) 2019-07-08 17:15:09 +08:00
Robert Nishihara 9cc4cc6a52 Fail format.sh if yapf/flake8 versions are incorrect. (#5083) 2019-07-04 23:22:01 -07:00
Eric Liang 34d054ff19 [rllib] ModelV2 API (#4926) 2019-07-03 15:59:47 -07:00
Eric Liang 904dcf081d Switch cluster longevity tests to DLAMI, fix ray up verbosity (#5084)
* fix

* add branch commit

* comments

* Update ci/long_running_tests/.gitignore

Co-Authored-By: Robert Nishihara <robertnishihara@gmail.com>
2019-07-02 00:19:05 -07:00
Philipp Moritz a6a02fccd0 Do not compile redis twice (#5074) 2019-07-01 15:42:54 -07:00
Simon Mo 0c4dd3c401 Use bazel disk cache with travis (#5068) 2019-06-30 17:57:48 -07:00
Robert Nishihara bcc379556b Make some fixes to long running stress tests. (#5056) 2019-06-28 15:42:54 -07:00
Richard Liaw b1827d5fbe [tune] Update MNIST Example (#4991) 2019-06-25 22:50:15 -07:00
Richard Liaw bd8aceb896 [ci] Change Jenkins to py3 (#5022)
* conda3

* integration

* add nevergrad, remotedata

* pytest 0.3.1

* otherdockers

* setup

* tune
2019-06-24 21:50:37 -07:00
Hao Chen 2bf92e02e2 [gRPC] Use gRPC for inter-node-manager communication (#4968) 2019-06-17 19:00:50 +08:00
Hao Chen 3c92b2ee4d Upgrade CI clang-format to 6.0 (#4976) 2019-06-14 14:52:32 +08:00
Robert Nishihara 6f48992322 Make release stress tests work and improve them. (#4955) 2019-06-10 23:04:01 -07:00
Eric Liang 9e328fbe6f [rllib] Add docs on how to use TF eager execution (#4927) 2019-06-07 16:42:37 -07:00
Robert Nishihara c3f8fc1c44 Update version number in documentation after release 0.7.0 -> 0.7.1 and 0.8.0.dev0 -> 0.8.0.dev1. (#4941) 2019-06-06 17:22:45 -07:00
Hersh Godse 89722ff003 [tune] Directional metrics for components (#4120) (#4915) 2019-06-02 22:13:40 -07:00
Eric Liang 7501ee51db [rllib] Rename PolicyEvaluator => RolloutWorker (#4820) 2019-06-03 06:49:24 +08:00
Peter Schafhalter c2ade075a3 [sgd] Distributed Training via PyTorch (#4797)
Implements distributed SGD using distributed PyTorch.
2019-06-01 21:39:22 -07:00
Eric Liang 1c073e92e4 [rllib] Fix documentation on custom policies (#4910)
* wip

* add docs

* lint

* todo sections

* fix doc
2019-06-01 16:13:21 +08:00
Eric Liang d7be5a5d36 [rllib] Fix error getting kl when simple_optimizer: True in multi-agent PPO 2019-05-27 17:24:45 -07:00
Robert Nishihara 7a78e1e320 Install bazel in autoscaler development configs. (#4874) 2019-05-26 16:13:50 -07:00