Commit Graph

5812 Commits

Author SHA1 Message Date
Barak Michener e8a2bd0f24 Revert "Bump version number everywhere to 1.0.0"
This reverts commit fa304a90ee.
2020-09-21 17:58:09 +00:00
SangBin Cho 2fb29eb680 [Core] Fix Flaky GCS actor manager test (#10600)
* Try.

* Fix the issue.

* Fix.
2020-09-21 17:56:12 +00:00
SangBin Cho f1fed7f662 [Doc] Document options method (#10830)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-09-21 17:56:03 +00:00
SangBin Cho 37a81476ec [Doc] Broken doc build fix. (#10865) 2020-09-21 17:55:52 +00:00
SangBin Cho c79eb7984d [docs] Placement group documentation (#10555)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-09-21 17:55:24 +00:00
Max Fitton a6a7886529 [Dashboard] Refresh documentation 1.0.0 (#10684) 2020-09-21 17:55:01 +00:00
Max Fitton bde9c734e8 [Documentation] local_mode doc updates and actor / worker explanation from Slack (#10748)
* wip

* Update local mode docs in all locations

* Update doc/source/actors.rst

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Update doc/source/actors.rst

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Change duplicated text to links to a subtitle for local_mode

* change a reference to be explicit

* Apply suggestions from code review

Co-authored-by: Max Fitton <max@semprehealth.com>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
Resolved Conflicts:
	doc/source/actors.rst
2020-09-21 17:54:24 +00:00
Richard Liaw decaa6dea0 [docs] slurm + progress_bar example (#10782) 2020-09-21 17:52:07 +00:00
Kai Yang 9c65373085 Java doc: "Configuring Ray" page (#10801) 2020-09-21 17:48:42 +00:00
chaokunyang e756a2bbba [Java] Refine java driver log (#10794) 2020-09-21 17:48:31 +00:00
Hao Chen 96ab025e66 [Java] rename config ray.redis.address to ray.address (#10772)
Resolved Conflicts:
        java/test.sh
2020-09-21 17:48:03 +00:00
Alex Wu 3205119ccb [autoscaler] hotfix calculate_node_resources (#10874) 2020-09-21 17:45:28 +00:00
Eric Liang 5f190b4e18 [autoscaler] Usability improvements in logging (#10764) 2020-09-21 17:44:39 +00:00
Richard Liaw a9830b4dd3 [cli] make test failure less verbose + print ssh (#10767) 2020-09-21 17:44:28 +00:00
Richard Liaw 3ef55578af [cli] Remove extra wording + fix travis (#10726) 2020-09-21 17:44:19 +00:00
Yiran Wang bdac0ac380 [Autoscaler] Change poll interval to 5 sec when checking VMs status (#10462) 2020-09-21 17:44:07 +00:00
Sven Mika b79773531e [RLlib] Issue 10833 TorchPolicy GPU. (#10834) 2020-09-21 17:43:56 +00:00
Alex Wu 5bbfc548c1 [1.0] Remove args from ray start (#10659)
Resolved Conflicts:
        java/test.sh
        python/ray/tests/test_multi_node.py
2020-09-21 17:43:22 +00:00
chaokunyang 4b58557309 bump java version to 1.0.0 (#10796) 2020-09-18 11:09:13 +08:00
Eric Liang ce671b3a94 Restore plasma directory option (#10784) 2020-09-17 18:50:33 +00:00
Richard Liaw f60f5d8748 [docs] init list of oss projects (#10758) 2020-09-17 18:50:21 +00:00
Ameer Haj Ali 9db51d21c4 Fix abstraction violations in command_runner interface (#10715)
* Fix abstraction violations in command_runner interface

* user guide

* lint

* breaking abstraction in commands

* extra initialization commands

* more cleanup

* small fixes

* fix test_integration_kubernetes.py

* lint

Co-authored-by: root <root@ip-172-31-28-155.us-west-2.compute.internal>
Co-authored-by: Ameer Haj Ali <ameerhajali@Ameers-MacBook-Pro.local>
2020-09-17 18:49:36 +00:00
Kai Fricke 2d08b2bb1c [tune] convert fallback representation to numbers in wandb integration (#10799) 2020-09-17 18:49:17 +00:00
Amog Kamsetty 7bf5f1af8b [Ray SGD] use_local flag + Worker group abstraction (#10539)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-09-17 18:48:57 +00:00
Barak Michener 49ee123c85 Remove superfluous execution of java (#10750) 2020-09-14 17:19:30 +00:00
Stephanie Wang 2c17e9f575 Fix segfault in network utils (#10741) 2020-09-14 17:19:06 +00:00
chaokunyang a4b5922d5e [Java] remove native binary from ray_dist.jar (#10461) 2020-09-14 17:18:27 +00:00
Kai Yang edd9916e30 Fix Java CI crash caused by incorrect destruction order in core worker (#10709) 2020-09-14 17:17:34 +00:00
Alex Wu df77a31242 [Autoscaler] Unmanaged nodes (#10513) 2020-09-14 17:14:06 +00:00
Ian Rodney 049b7b2017 [docker] Revert to rsync & cp instead of file mount for bootstrap config/key (#10734) 2020-09-14 17:13:06 +00:00
Ian Rodney 0592dcacba [autoscaler] Fix rsync file mounts (#10721) 2020-09-14 17:12:33 +00:00
Ian Rodney 686c389562 [autoscaler] use default value (#10706) 2020-09-14 17:12:15 +00:00
Ian Rodney 826a9253c6 [docker] Detect CPUs in container correctly (#10507)
Co-authored-by: simon-mo <simon.mo@hey.com>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
Co-authored-by: Alex Wu <itswu.alex@gmail.com>
2020-09-14 17:11:47 +00:00
Richard Liaw fe23f23680 [tune/rllib] revert removal of queue-trials (#10744) 2020-09-14 17:11:20 +00:00
Eric Liang 70305267d2 Remove colorful from ray core (#10723) 2020-09-14 17:09:56 +00:00
Alex Wu 72e19ede28 [hotfix] accelerator_types (#10725)
* .

* .
2020-09-14 17:09:28 +00:00
Alex Wu c4aaeab256 [Autoscaler] Fix utilization calc (#10728) 2020-09-14 17:08:58 +00:00
Alex Wu c2156c3ffa [hotfix] Autoscaler's K8 support (#10766)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-09-14 17:08:28 +00:00
Richard Liaw cb4ebb86c0 [autoscaler] make commands very explicit on logs (#10713) 2020-09-10 21:08:40 +00:00
Richard Liaw 9c6ab77d54 [autoscaler] Create provider exactly once (#10703)
Co-authored-by: Alex Wu <itswu.alex@gmail.com>
2020-09-10 21:08:23 +00:00
Kai Fricke 9bae286f42 [tune] wandb log cleaning to use yaml representer (#10680)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-09-10 21:07:49 +00:00
Barak Michener fa304a90ee Bump version number everywhere to 1.0.0 2020-09-09 18:35:14 +00:00
Max Fitton 3e8164ff8a [Dashboard] Logical View Actor Class Grouping Details (#10453)
* wip

* wip

* wip

* wip

* Need to track the timestamp actors are created for the dashboard. This adds that functionality back in and deletes unused code

* Add the materialui lab packages to get access to the Alert component and fix up some vulnerabilities with npm audit.

* Finish supporting information on a per-actor-class basis in the logical view, add bug fixes around timestamps and infeasible task names, and add a new warning popup that shows if there are infeasible actors around.

* lint and add seconds annotation to actor lifetime values

* real lint

* remove typo

* Somehow missed something last lint

* Add new comments for actor states

* Add underscores to some private functions

* Add tooltips to the actor states on the logical view

* change test metrics to be aligned with new changes.

* lint

* Remove some unnecessary log lines and catch error that happens when we try to decode data from an unexpected source

* Re-add a function I had removed. It is used in the Java codebase.

Co-authored-by: Max Fitton <max@semprehealth.com>
2020-09-09 10:34:54 -07:00
desktable 799318d7d7 [RLlib] Add type annotations for agents/dqn (#10626) 2020-09-09 18:55:26 +02:00
Richard Liaw 153813936b [tune] auto infer metrics (#10663)
Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Kai Fricke <kai@anyscale.com>
2020-09-09 09:53:47 -07:00
Richard Liaw 3501ea396c [tune] All examples to use ConcurrencyLimiter (#10662)
Co-authored-by: Kai Fricke <kai@anyscale.com>
2020-09-09 09:52:15 -07:00
Alex Wu cd5b99e5e0 [hotfix] redis_password -> _redis_password (#10672) 2020-09-09 09:40:49 -07:00
Sven Mika 4b278c36fc [RLlib] Behavioral Cloning (from MARWIL). (#10619) 2020-09-09 17:33:21 +02:00
Kai Yang afa0216280 Remove the '--include-java' option (#10594) 2020-09-09 17:01:17 +08:00
chaokunyang ccf27a9ad2 [Streaming] Fix streaming ci (#10665) 2020-09-09 16:53:43 +08:00