7243 Commits

Author SHA1 Message Date
Dominic Ming 752da83bb7 [Dashboard] Add the new dashboard code and prompt users to try it (#11667) 2021-01-29 15:22:26 +08:00
Stephanie Wang 42d501d747 [core] Pin arguments during task execution (#13737)
* tmp

* Pin task args

* unit tests

* update

* test

* Fix
2021-01-28 19:07:10 -08:00
Ian Rodney 813a7ab0e2 [docker] Build Python3.6 & Python3.8 Docker Images (#13548) 2021-01-28 15:24:50 -08:00
Tanja Bayer 0c906a8b93 [Docker] usage of python-version (#13011)
Co-authored-by: Tanja Bayer <tanja.bayer@widas.de>
Co-authored-by: Ian Rodney <ian.rodney@gmail.com>
2021-01-28 14:27:54 -08:00
architkulkarni cb771f263d [Serve] Add ServeHandle metrics (#13640) 2021-01-28 14:40:47 -06:00
Sven Mika 4bc257f4fb [RLlib] Fix custom multi action distr (#13681) 2021-01-28 19:28:48 +01:00
Lena Kashtelyan c583113d66 [Ax] Align optimization mode and reported SEM with Ax (#13611)
* [Ax] Align optimization mode and reported SEM with Ax

Ensure that `mode` aligns with the mode set in Ax + report SEM as None rather than as 0.0 to make use of Ax noise inference

* Account for review

* Update ax.py

* Fix lint

* Fix tests, ad additional checks

* Fix tests for python 3.6

Co-authored-by: Kai Fricke <kai@anyscale.com>
2021-01-28 19:01:51 +01:00
Yuri Rocha b01b0f80aa [RLlib] Fix multiple Unity3DEnvs trying to connect to the same custom port (#13519) 2021-01-28 13:28:08 +01:00
cathrinS d4ef5c5993 [RLlib] Atari-RAM-Preprocessing, unsigned observation vector results in a false preprocessed observation (#13013) 2021-01-28 12:07:00 +01:00
Tao Wang 56ee6ef55f [GCS]only update states related fields when publish actor table data (#13448) 2021-01-28 11:12:57 +08:00
architkulkarni cb95ff1e56 [Serve] Add "endpoint registered" message to router log (#13752) 2021-01-27 19:03:15 -08:00
Simon Mo 4f1f558802 [Core] Hotfix Windows Compilation Error for ClusterTaskManager (#13754)
* [Core] Hotfix Windows Compilation Error for ClusterTaskManager

* fix
2021-01-27 19:01:56 -08:00
Simon Mo c10abbb1bb Revert "[Serve] Fix ServeHandle serialization (#13695)" (#13753)
This reverts commit 202fbdf38c.
2021-01-27 17:47:42 -08:00
Eric Liang 2e01d5d26e Report failed deserialization of errors in Ray client 2021-01-27 17:37:50 -08:00
Zhe Zhang 0e7343ec19 [docs] Fix MLflow / Tune example in documentation (#13740)
Minor fixes to make it runnable
2021-01-27 17:16:29 -08:00
Dmitri Gekhtman 40234ad631 [autoscaler][AWS] Make sure subnets belong to same VPC as user-specified security groups (#13558)
* initial commit

* Filter subnets by security groups' VPCs

* fix stubs

* wip

* Fix inbound rule logic. Tests WIP.

* wip

* unit test

* example yaml

* Unit test tests for bug being fixed

* Update python/ray/tests/aws/utils/constants.py

Co-authored-by: Thomas Desrosiers <681004+thomasdesr@users.noreply.github.com>

Co-authored-by: Thomas Desrosiers <681004+thomasdesr@users.noreply.github.com>
2021-01-27 17:00:52 -08:00
architkulkarni 28cf5f91e3 [docs] change MLFlow to MLflow in docs (#13739) 2021-01-27 16:53:15 -08:00
Simon Mo 25fa391193 [Core] Add private on_completed callback for ObjectRef (#13688) 2021-01-27 16:32:00 -08:00
SangBin Cho 32ec0d205f [Object Spilling] Remove job id from the io worker log name. (#13746) 2021-01-27 16:26:32 -08:00
Ian Rodney bdf0c00989 Revert "Revert "[CLI] Fix Ray Status with ENV Variable set (#13707) (#13726) 2021-01-27 15:33:33 -08:00
Alex Wu c0fe816466 [Core/Autoscaler] Properly clean up resource backlog from (#13727) 2021-01-27 15:30:58 -08:00
Simon Mo 3644df415a [CI] Add retry to java doc test (#13743) 2021-01-27 14:18:06 -08:00
Eric Liang 56a9523020 Fix high CPU usage in object manager due to O(n^2) iteration over active pulls list (#13724) 2021-01-27 14:02:22 -08:00
Ian Rodney c5209e2dab [Docker] default to /home/ray (#13738) 2021-01-27 13:46:07 -08:00
Ian Rodney b4bcb9b60a [Docker] Use Cuda 11 (#13691) 2021-01-27 13:45:30 -08:00
Eric Liang eba698d48e Remove docs for install-nightly (#13744) 2021-01-27 13:10:45 -08:00
architkulkarni 202fbdf38c [Serve] Fix ServeHandle serialization (#13695) 2021-01-27 14:11:31 -06:00
Edward Oakes 06fac785b8 [serve] Fix whacky worker replica failure test (#13696) 2021-01-27 14:05:37 -06:00
Clark Zinzow 2d34e95c93 Don't gather check_parent_task on Windows, since it's undefined. (#13700) 2021-01-27 09:19:58 -08:00
Kai Fricke c5b645e3da [tune] add type hints to tune.run(), fix abstract methods of ProgressReporter (#13684) 2021-01-27 16:43:50 +01:00
Kai Fricke 2664a2a8f6 [tune] fix non-deterministic category sampling by switching back to np.random.choice (#13710)
* Enable zoopt tests again, but wait for next release

* Add test and preserve state in trial executor

* Add baseline check with integers

* [tune] fix non-deterministic category sampling, re-enable zoopt tests

* Remove random import

* Disable zoopt tests
2021-01-27 16:42:44 +01:00
DK.Pino 7f6d326ad8 [Placement Group]Add detached support for placement group. (#13582) 2021-01-27 18:51:26 +08:00
SangBin Cho d2963f4ee1 [Object Spilling] Clean up FS storage upon sigint for ray.init(). (#13649)
* Initial iteration done.

* Remove unnecessary messages.

* Addressed code review.

* Addressed code review.

* fix issues.

* addressed code review.

* Addressed the last code review.
2021-01-26 23:10:29 -08:00
SangBin Cho 8baafacb1e [Logging] Log rotation config (#13375)
* In Progress.

* formatting.

* in progress.

* linting.

* Done.

* Fix typo.

* Fixed the issue.
2021-01-26 20:15:55 -08:00
Simon Mo 9cf0c49015 [CI] Skip test_multi_node_3 on Windows (#13723)
test_multi_node_3 was recently split from test_multi_node, but we forgot
to skip it on Windows
2021-01-26 16:12:13 -08:00
Ian Rodney 4db0a31130 [Core] Better error if /dev/shm is too small (#13624) 2021-01-26 15:26:45 -08:00
Rand Xie 4f4e1b664b Fix multiprocessing starmap to allow passing in zip (#13664) 2021-01-26 16:15:35 -06:00
Simon Mo 2f482193b9 Revert "[CLI] Fix Ray Status with ENV Variable set (#13707)" (#13719)
This reverts commit 5d82654022.
2021-01-26 14:14:51 -08:00
Ian Rodney ab6a634a94 [Serve] Revert "Revert "[Serve] Refactor BackendState" (#13626) (#13697) 2021-01-26 15:31:01 -06:00
Barak Michener f490e2be43 [ray_client] Fix and extend get_actor test to detached actors (#13016) 2021-01-26 15:19:51 -06:00
Amog Kamsetty 6b477dd37a [CI] Split test_multi_node to avoid timeouts (#13712) 2021-01-26 12:06:19 -08:00
Barak Michener 0c46d09940 [ray_client]: Monitor client stream errors (#13386) 2021-01-26 10:56:56 -08:00
Ian Rodney 5d82654022 [CLI] Fix Ray Status with ENV Variable set (#13707) 2021-01-26 10:29:42 -08:00
Dmitri Gekhtman ddcbd229ba Rename the ray.operator module to ray.ray_operator (#13705)
* Rename ray.operator module

* mypy
2021-01-26 10:29:07 -08:00
Amog Kamsetty 4aff86bfa7 [CI] skip failing java tests (#13702) 2021-01-26 10:17:58 -08:00
Edward Oakes 5d882b062d [Serve] fix k8s doc (#13713) 2021-01-26 10:09:13 -08:00
dependabot[bot] 148b1022d6 [tune](deps): Bump autogluon-core in /python/requirements (#13698)
Bumps [autogluon-core](https://github.com/awslabs/autogluon) from 0.0.16b20210122 to 0.0.16b20210125.
- [Release notes](https://github.com/awslabs/autogluon/releases)
- [Changelog](https://github.com/awslabs/autogluon/blob/master/docs/ReleaseInstructions.md)
- [Commits](https://github.com/awslabs/autogluon/commits)

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-01-26 11:32:56 +01:00
dependabot[bot] ef1f7e4d42 [tune](deps): Bump smart-open[s3] in /python/requirements (#13699)
Bumps [smart-open[s3]](https://github.com/piskvorky/smart_open) from 4.0.1 to 4.1.2.
- [Release notes](https://github.com/piskvorky/smart_open/releases)
- [Changelog](https://github.com/RaRe-Technologies/smart_open/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/piskvorky/smart_open/compare/4.0.1...v4.1.2)

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-01-26 11:32:17 +01:00
Hao Zhang 7a78f4e959 [Collective][PR 4/6] NCCL Communicator caching and preliminary stream management (#13030)
Co-authored-by: Dacheng Li <dal177@ucsd.edu>
2021-01-26 01:05:21 -08:00
Alex Wu 840987c7af Scalability Envelope Tests (#13464) 2021-01-25 18:48:31 -08:00