Commit Graph

3346 Commits

Author SHA1 Message Date
Tao Wang b85c6abc3e Rename fields/variables from client id to node id (#12457) 2020-11-30 14:33:36 +08:00
SangBin Cho 3964defbe1 [Logging] Fix tensorflow logging issue. (#12225)
* in progress.

* ip

* In Progress

* done.

* fix lint.

* Addressed code review

* Addressed code review.
2020-11-29 22:16:52 -08:00
SangBin Cho 91d54ef621 [Core] Remove actor arg from executor to allow users to specify actor… (#12239)
* [Core] Remove actor arg from executor to allow users to specify actor arg in their Actor.remote.

* Addressed code review.
2020-11-29 22:15:48 -08:00
chaokunyang 17a6b9bbe7 Fix not cp jars (#12456) 2020-11-30 13:53:09 +08:00
Philipp Moritz cf73ccddae Allow more fields for object metadata (#12484) 2020-11-29 21:50:18 -08:00
Alex Wu f1cc33a6a6 Actor resource backlog hotfix (#12471)
* prepare implemented

* works?

* deflek

* git

* deflek round 2

* .

* improve the test

Co-authored-by: Alex <alex@anyscale.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
2020-11-29 20:55:50 -08:00
Amog Kamsetty 8a406e1f9a [SGD] Add PTL Docs (#12440)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-11-28 10:09:38 -08:00
Kai Fricke 1d0ade1b93 Revert "[Core] zero-copy serializer for pytorch (#12344)" (#12469)
This reverts commit 0a505ca8
2020-11-28 10:06:02 -08:00
Eric Liang 569eee5e71 Enable more new scheduler tests (#12421) 2020-11-27 16:10:38 -08:00
Richard Liaw 7c009d22cf [docs] Add xgboost_ray to docs (#12184)
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
2020-11-27 11:36:56 -08:00
Siyuan (Ryans) Zhuang 0a505ca83d [Core] zero-copy serializer for pytorch (#12344)
* zero-copy serializer for pytorch

* address possible bottleneck

* add tests & device support
2020-11-26 16:09:54 -08:00
Amog Kamsetty e0573df337 [CI] Fix windows build (#12415)
* attempt to fix windows

* fix syntax

* try again

* try again

* try again

* Revert "[ray_client] Support calling functions from other functions and correct the tests (#12141)"

This reverts commit 4066056a0d.

* Revert

* Revert "Revert "[ray_client] Support calling functions from other functions and correct the tests (#12141)""

This reverts commit bb27b87b6c8d780ad796f4d4aeaa20113c8eca79.

* please work

* works

* fix
2020-11-26 10:52:11 -08:00
Sven Mika c1d7826bb7 [RLlib] Move pettingzoo from requirements.txt into requirements_rllib.txt (#12400) 2020-11-26 19:30:35 +01:00
Ameer Haj Ali 9ccf5f6ccc [ray client] add metadata and secure options to Worker. (#12409) 2020-11-25 17:48:13 -08:00
Richard Liaw 323941c745 [tune] fix pbt flakey test (#12418) 2020-11-25 16:58:37 -08:00
Eric Liang f6a5b733d5 Remove flaky object manager test that's no longer needed 2020-11-25 12:45:47 -08:00
Ian Rodney 679492a235 [serve] Use Long Polling in Backend Worker (#12093) 2020-11-25 12:11:38 -08:00
SangBin Cho 753cda2f28 [Dashboard] Delete old dashboard (#12144)
* Delete old dashboard from repo.

* Delete old dashboard from repo. 2
2020-11-25 11:31:02 -08:00
ZhuSenlin dc55f6ba3a skip gcs fault tolerance test for the time being when new scheduler is enabled (#12393)
Co-authored-by: senlin.zsl <senlin.zsl@antfin.com>
2020-11-25 10:40:47 -08:00
SangBin Cho 2e4e285ef0 [Object Spilling] Fusion small objects (#12087) 2020-11-25 10:13:32 -08:00
Ian Rodney c5845c3a4e [docker] Docker stop on each node (#12357) 2020-11-24 23:15:53 -08:00
Barak Michener 4066056a0d [ray_client] Support calling functions from other functions and correct the tests (#12141)
* Add test mode and fix f calling g

* formatting

* remove unused functions

* fix tests -- which will be better in actor PR
2020-11-24 22:19:20 -08:00
Tao Wang e1075c0a82 [GCS]Fill resource fields when re-report heartbeat after gcs restarted (#12097) 2020-11-25 11:07:02 +08:00
Edward Oakes dae137b919 Don't allow 'optional' files in setup.py (#12359) 2020-11-24 17:41:58 -06:00
Eric Liang 5895554555 [autoscaler] Raise node "start" deadline to 900s, make configurable (#12316) 2020-11-24 12:16:01 -08:00
Edward Oakes 4ada3e4c99 [serve] Incremental change towards async control loop for replica startup (#12281) 2020-11-24 13:06:08 -06:00
roireshef 888357d251 added address resolution fix for running in docker containers (#11944)
* added address resolution fix for running in docker containers

* added address resolution fix for running in docker containers (java)

* Update RayNativeRuntime.java

Co-authored-by: Eric Liang <ekhliang@gmail.com>
2020-11-24 10:34:56 -08:00
Edward Oakes be0fa7b8b4 Properly specify kubectl-rsync.sh in setup.py (#12356) 2020-11-24 12:13:29 -06:00
Gekho457 e66ddab190 [autoscaler/k8s] Handle unavailable k8s API (#12283) 2020-11-24 11:13:15 -06:00
Kai Fricke b94bfdfa99 [tune] use default anonymous metric _metric if at least a mode is set (#12159)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-11-23 20:09:33 -08:00
Edward Oakes 135f2e0602 [serve] Only use typed BackendConfig in controller (#12275) 2020-11-23 20:56:04 -06:00
Edward Oakes be2187f942 Remove use_legacy_config tests (#12272) 2020-11-23 20:30:50 -06:00
Edward Oakes 1b3c48b922 Remove unused _post_get_hooks (#12270) 2020-11-23 20:30:30 -06:00
Edward Oakes a93ed4583d Remove unused print_failed_task (#12267) 2020-11-23 16:33:10 -06:00
Richard Liaw e59fe65d3d [tune] Fix logging for dockersyncer (#12196) 2020-11-23 14:29:41 -08:00
Edward Oakes 4e43650851 Remove deprecated f._submit API (#12268) 2020-11-23 16:20:22 -06:00
Sumanth Ratna 6346ea0be6 [tune] Allow None values in TBX logger (#12262) 2020-11-23 12:57:23 -08:00
viotemp1 4c4f189f97 [tune] nevergrad add points_to_evaluate (#12207) 2020-11-23 12:51:04 -08:00
Ian Rodney 1cf95cb081 [Serve] Remove old report-queue-code (#12211) 2020-11-23 12:42:05 -08:00
Ian Rodney 7f1f16d99e [Core] Force get_actor(name)'s name to be non-empty string (#12218) 2020-11-23 12:38:16 -08:00
Richard Liaw 40428c9b05 [tune] Make tf distributed testing smaller (#12173) 2020-11-23 12:15:10 -08:00
Edward Oakes c99c376d66 Remove unused logging job id field (#12266) 2020-11-23 13:00:15 -06:00
Kaushik B 643c07e451 [tune] update param & install hints for Search Algos (#12242)
* fix(tune): change objective_name param to metric for AxSearch

* fix(tune): install hints for search algos
2020-11-22 15:39:41 -08:00
SangBin Cho eb89c567e2 [Metrics] Skip prom file test on Windows. (#12234) 2020-11-22 00:32:18 -08:00
Ameer Haj Ali 2a9499a1bd [autoscaler] cover autofilling resources with try except. (#12222)
Co-authored-by: Ameer Haj Ali <ameerhajali@ameers-mbp.lan>
Co-authored-by: Alex Wu <alex@anyscale.io>
Co-authored-by: Alex Wu <itswu.alex@gmail.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
Co-authored-by: Ameer Haj Ali <ameerhajali@Ameers-MacBook-Pro.local>
2020-11-21 13:04:24 -08:00
Kai Fricke 608d0378c4 [tune] Add test for infinite trials (#12156) 2020-11-21 12:54:01 -08:00
Eric Liang 839517743d Support ray.* in remote functions for Ray client (#12177) 2020-11-20 13:28:46 -08:00
Richard Liaw 48042be8bb [tune] Avoid dependency on Kubernetes (#12188)
* fix-kubernetes

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* kub

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2020-11-20 13:01:20 -08:00
Simon Mo d200f620ee Deflake test_router (#12175) 2020-11-19 18:37:46 -08:00
dHannasch 4b2c5daf45 State which IP addresses are failing to match. (#11957)
* State which IP addresses are failing to match.

* Use f-string.

* action item?

* I could swear swear this passed with length 80 before

* wait, this is how it wants f-strings

* reword

* action item

* f

Co-authored-by: SangBin Cho <rkooo567@gmail.com>

* f

Co-authored-by: SangBin Cho <rkooo567@gmail.com>

* f

Co-authored-by: SangBin Cho <rkooo567@gmail.com>

Co-authored-by: SangBin Cho <rkooo567@gmail.com>
2020-11-19 17:25:25 -08:00