Commit Graph

136 Commits

Author SHA1 Message Date
Richard Liaw 48ba484640 [tune] Test TF2.0, TF1.14, TF1.12 Tensorboard support (#5931) 2019-10-18 13:50:42 -07:00
Richard Liaw d52a4983af Update TF documentation (#5918) 2019-10-16 01:31:27 -07:00
Richard Liaw 9f23620412 [tune] tf2.0 mnist example (#5898)
* tfmnistexample

* tfmnist

* add_to_ci

* format

* exampledownlaod

* fix
2019-10-15 22:25:01 -07:00
Edward Oakes abbfe7392f Bump dev version to 0.8.0.dev6 (#5906) 2019-10-14 11:36:13 +01:00
Richard Liaw 1650f7b174 [tune] Remove TF MNIST example + add TrialRunner hook to execut… (#5868)
* remove test

* add trial runner

* remvoerestore

* Remove other mnist examples

* tunetest

* revert

* v1

* Revert "v1"

This reverts commit c8bddaf2db7a8270c43c02021cac0e75df15ed20.

* Revert "revert"

This reverts commit b58f56884a0c288d3a6f997d149ab4d496ddd7a3.

* errors

* format
2019-10-13 20:33:56 -07:00
Robert Nishihara 523c764c25 Python 2 compatibility. (#5887) 2019-10-10 19:09:25 -07:00
Eric Liang 04e997fe0d Fix TF2 / rllib test (#5846) 2019-10-07 14:25:16 -07:00
Simon Mo e8570874b6 [Serve] Implement flask_request and named python request (#5849)
* Implement flask_request and named python request

* Forgot to include missing files

* Address comment

* Add flask to requirements for doc (lint failed)

* Update doc requirement so lint will build

* Install flask in CI

* Fix typo in .travis.yml
2019-10-06 15:12:30 -07:00
Anthony Yu b99cdf4e39 [tune] PBT + Memnn example (#5723)
* Add example file

* Move into train function

* Somewhat working example of MemNN, still has some failed trials

* Reorganize into a class

* Small fixes

* Iteration decrease and fix hyperparam_mutations

* Add example file

* Move into train function

* Somewhat working example of MemNN, still has some failed trials

* Reorganize into a class

* Small fixes

* Iteration decrease and fix hyperparam_mutations

* Some style edits

* Address PR changes without modifying learning rate

* Add configs and hyperparameter mutations

* Add tune test

* Modify import locations

* Some parameter changes for testing

* Update memnn example

* Add tensorboard support and address PR comment

* Final changes

* lint

* generator
2019-10-05 09:22:37 -07:00
Edward Oakes 972dddd776 [autoscaler] Kubernetes autoscaler backend (#5492)
* Add Kubernetes NodeProvider to autoscaler

* Split off SSHCommandRunner

* Add KubernetesCommandRunner

* Cleanup

* More config options

* Check if auth present

* More auth checks

* Better output

* Always bootstrap config

* All working

* Add k8s-rsync comment

* Clean up manual k8s examples

* Fix up submit.yaml

* Automatically configure permissisons

* Fix get_node_provider arg

* Fix permissions

* Fill in empty auth

* Remove ray-cluster from this PR

* No hard dep on kubernetes library

* Move permissions into autoscaler config

* lint

* Fix indentation

* namespace validation

* Use cluster name tag

* Remove kubernetes from setup.py

* Comment in example configs

* Same default autoscaling config as aws

* Add Kubernetes quickstart

* lint

* Revert changes to submit.yaml (other PR)

* Install kubernetes in travis

* address comments

* Improve autoscaling doc

* kubectl command in setup

* Force use_internal_ips

* comments

* backend env in docs

* Change namespace config

* comments

* comments

* Fix yaml test
2019-10-03 10:17:00 -07:00
Edward Oakes ef1a61ab57 Log output in test_dead_actors.py (#5831) 2019-10-02 14:40:55 -07:00
Edward Oakes 443feb75f0 Fix test (#5810) 2019-09-30 19:39:53 -07:00
Richard Liaw baf85c6665 [tune/sgd] Fix Jenkins (#5765) 2019-09-27 09:59:08 -07:00
Eric Liang b5da32df78 Bump Ray version in documentation to dev5 (#5794) 2019-09-27 00:19:17 -07:00
Philipp Moritz 57a5871ea6 Convert long running stress tests to projects (#5641) 2019-09-26 11:25:09 -07:00
Eric Liang 5ecb02fb80 Release 0.7.5 updates (#5727) 2019-09-26 10:30:37 -07:00
Richard Liaw 10f21fa313 [docs] Convert Examples to Gallery (#5414) 2019-09-24 15:46:56 -07:00
Mitchell Stern 98dcc1d440 [Dashboard] Add initial version of new dashboard (#5730) 2019-09-23 08:50:40 -07:00
Robert Nishihara 1cfadf032e Properly test Python wheels in Travis. (#5749) 2019-09-21 18:03:10 -07:00
Richard Liaw e00071721a [tune] tf2.0 testing and supporting callables (#5738) 2019-09-21 17:01:14 -07:00
Simon Mo 5f88823c49 [Serve] Rewrite Ray.Serve From Scratch (#5562)
* Commit and format files

* address stylistic concerns

* Replcae "Usage" by "Example" in doc

* Rename srv to serve

* Add serve to CI process; Fix 3.5 compat

* Improve determine_tests_to_run.py

* Quick cosmetic for determien_tests

* Address comments

* Address comments

* Address comment

* Fix typos and grammar

Co-Authored-By: Edward Oakes <ed.nmi.oakes@gmail.com>

* Update python/ray/experimental/serve/global_state.py

Co-Authored-By: Edward Oakes <ed.nmi.oakes@gmail.com>

* Use __init__ for Query and WorkIntent class

* Remove dataclasses dependency

* Rename oid to object_id for clarity

* Rename produce->enqueue_request, consume->dequeue_request

* Address last round of comment
2019-09-13 21:36:56 -07:00
Edward Oakes 07c4c6367a [core worker] Python core worker object interface (#5272) 2019-09-12 23:07:46 -07:00
Edward Oakes 6ab5714465 Temporarily remove pytest-sugar dependency (#5627) 2019-09-03 23:05:17 -07:00
jichan3751 1711e202a3 [training] Tensorflow interface for MultiNode SGD (#5440) 2019-09-03 15:35:42 -07:00
Eric Liang a101812b9f Replace --redis-address with --address in test, docs, tune, rllib (#5602)
* wip

* add tests and tune

* add ci

* test fix

* lint

* fix tests

* wip

* sugar dep
2019-09-01 16:53:02 -07:00
Richard Liaw 411f30c125 [docs] Second push of changes (#5391) 2019-08-28 17:54:15 -07:00
Eric Liang 97ccd75952 [rllib] Enable object store memory limit by default (#5534) 2019-08-26 01:37:28 -07:00
gehring b520f6141e [rllib] Adds eager support with a generic TFEagerPolicy class (#5436) 2019-08-23 14:21:11 +08:00
Richard Liaw cdc9227f1b [tune] ASHA xgboost and lightgbm examples (#5500) 2019-08-22 10:37:59 -07:00
Philipp Moritz e065f55998 Fix impala stress test (#5491) 2019-08-20 19:49:46 -07:00
Robert Nishihara 851c5b2dae Add a script for benchmarking performance for Ray developers. (#5472) 2019-08-19 23:41:23 -07:00
Richard Liaw d7b309223b [tune] MLFlow Logger (#5438) 2019-08-14 15:58:18 -07:00
Lisa Dunlap b7d0733362 [tune] Implement BOHB (#5382) 2019-08-13 12:32:07 -07:00
Eric Liang a1d2e17623 [rllib] Autoregressive action distributions (#5304) 2019-08-10 14:05:12 -07:00
jichan3751 de95117e96 [sgd] Tune interface for Pytorch MultiNode SGD (#5350) 2019-08-10 13:51:44 -07:00
Simon Mo 18f1e904de Bump 0.8.0.dev2 -> 0.8.0.dev3 (#5409) 2019-08-09 11:37:19 -07:00
Eric Liang 1a8fa5d2fa Clean up top level Ray dir (#5404) 2019-08-08 23:35:55 -07:00
Eric Liang 592f313210 [rllib] Centralized critic / PPO example on TwoStepGame (#5392) 2019-08-08 14:03:28 -07:00
Simon Mo 50b93bf179 Check upstream with git remote (#5377) 2019-08-06 17:30:33 -07:00
Wonseok Jeon 281829e712 MADDPG implementation in RLlib (#5348) 2019-08-06 16:22:06 -07:00
Eric Liang 5d7afe8092 [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
Richard Liaw 1eaa57c98f [tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
Eric Liang 3bdd114282 [rllib] Better example rnn envs (#5300) 2019-07-28 14:07:18 -07:00
Eric Liang a62c5f40f6 [rllib] Document ModelV2 and clean up the models/ directory (#5277) 2019-07-27 02:08:16 -07:00
Richard Liaw 7e715520e5 [sgd] Example for Training (#5292) 2019-07-27 01:10:25 -07:00
Eric Liang f9043cc49a [rllib] Remove experimental eager support 2019-07-21 12:27:17 -07:00
Eric Liang d58b986858 [rllib] MultiCategorical shouldn't return array for kl or entropy (#5215)
* wip

* fix
2019-07-19 12:12:04 -07:00
Jones Wong 0af07bd493 Enable seeding actors for reproducible experiments (#5197)
*  enable graph-level worker-specific seed

*  lint checked

*  revised according to eric's suggestions

*  revised accordingly and added a test case

*  formated

* Update test_reproducibility.py

* Update trainer.py

* Update rollout_worker.py

* Update run_rllib_tests.sh

* Update worker_set.py
2019-07-17 23:31:34 -07:00
Eric Liang 047f4ccd61 [rllib] Fix rollout.py with tuple action space (#5201)
* fix it

* update doc too

* fix rollout
2019-07-16 10:52:35 -07:00
Simon Mo fd71ffde2f Improve release process 0.7.2 (#5187) 2019-07-15 14:46:54 -07:00