Commit Graph

  • 6aba4ab8f9 Disable validation of cluster config on the cluster to allow for cluster configs with new properties. (#11693) Alan Guo 2020-10-30 14:02:00 -07:00
  • cce91b51bd [docker] Fix docker regex (#11726) Alex Wu 2020-11-02 11:23:06 -08:00
  • 171e02c684 [serve] re-enable serve-controller-crash test (#11579) Ian Rodney 2020-11-02 11:22:09 -08:00
  • 4a7d0e059d [GCS]Optimize subscription perf (#11669) fangfengbin 2020-11-03 01:46:04 +08:00
  • 8346dedc3a Fix the linter failure. (#11755) dHannasch 2020-11-02 10:02:15 -07:00
  • 26176ec570 [RLlib] Fix epsilon_greedy on nested_action_spaces only in pytorch (#11453) bcahlit 2020-11-02 19:22:33 +08:00
  • 54d85a6c2a [RLlib] Fix RNN learning for tf-eager/tf2.x. (#11720) Sven Mika 2020-11-02 11:18:41 +01:00
  • bfc4f95e01 [RLlib] Fix test_bc.py test case. (#11722) Sven Mika 2020-10-31 08:16:09 +01:00
  • 7fbf938c3f Version bump 1.0.1 Alex Wu 2020-10-30 20:23:59 -07:00
  • f137808518 [tune] PB2 (#11466) Jack Parker-Holder 2020-10-27 08:03:21 +00:00
  • 1f1ea85a5c Fix asyncio plasma integration in cluster mode (#11665) Simon Mo 2020-10-29 11:53:10 -07:00
  • 44a379ee9b [tune] fixed validation for search metrics (#11583) Raoul Khouri 2020-10-23 20:04:21 -04:00
  • af5252901a [release] Do not tag docker latest on release builds (#11694) Alex Wu 2020-10-29 23:13:25 -07:00
  • e8c4f9e776 [releng]: Quiet Docker Push (and explain why) (#11623) Barak Michener 2020-10-29 00:18:51 -07:00
  • 48dee789b3 Add random actor placement; fix cancellation callback; update test skips (#11684) Eric Liang 2020-10-30 18:36:35 -07:00
  • b10871a1f5 [Core]Fix get workrer table bug (#11516) DK.Pino 2020-10-31 05:48:29 +08:00
  • 71c5089854 [Object Spilling] Initial Iteration of S3 adapter. (#11379) SangBin Cho 2020-10-30 14:47:07 -07:00
  • 7aade469d0 [autoscaler] fix the autoscaling bug for continuously launching failed nodes (#11714) Ameer Haj Ali 2020-10-30 23:12:06 +02:00
  • 8816d34541 Kubernetes rsync verbosity fixed (#11716) Gekho457 2020-10-30 17:03:42 -04:00
  • 3c109b45aa Disable validation of cluster config on the cluster to allow for cluster configs with new properties. (#11693) Alan Guo 2020-10-30 14:02:00 -07:00
  • f9f372c327 [autoscaler] Clean up monitoring loop code (#11677) Eric Liang 2020-10-30 13:48:43 -07:00
  • 6e2a1eac36 [Placement Group] Placement group automatic cleanup. (#11546) SangBin Cho 2020-10-30 10:55:43 -07:00
  • 5a83d8918a [release] Do not tag docker latest on release builds (#11694) Alex Wu 2020-10-29 23:13:25 -07:00
  • b4df42b027 [Dashboard] Make Infeasible Actor UX Less Scary (#11654) Max Fitton 2020-10-29 23:12:43 -07:00
  • d6628cdbfb [Dashboard] Fix null gpu utilization (#11650) Max Fitton 2020-10-29 23:11:50 -07:00
  • e022d12dc3 [New scheduler] Deflake test heartbeat (#11586) Alex Wu 2020-10-29 23:10:19 -07:00
  • 4175569d96 [Core] Add option to override environment variables for tasks and actors (#11619) architkulkarni 2020-10-29 12:22:44 -07:00
  • e82ff08b0c Fix asyncio plasma integration in cluster mode (#11665) Simon Mo 2020-10-29 11:53:10 -07:00
  • 0b7a3d9e02 [Log] new spdlog tool for ray (#10967) Lingxuan Zuo 2020-10-30 02:37:13 +08:00
  • 87e971bff0 [docker] Include python k8s package in ray-deps (#11703) Ian Rodney 2020-10-29 10:57:23 -07:00
  • 6999db93cb Un-indent multiagent section (#11310) Yutai Zhou 2020-10-29 11:12:48 -04:00
  • 0b07af374a allow tuple action space (#11429) Jiajie Xiao 2020-10-29 08:05:38 -07:00
  • 91fa7e0b4e [releng]: Quiet Docker Push (and explain why) (#11623) Barak Michener 2020-10-29 00:18:51 -07:00
  • 46afec5660 Mute asyncio warning for Serve (#11682) Simon Mo 2020-10-28 17:05:42 -07:00
  • 64e3c9741a Update rllib-algorithms.rst (#11642) huyz-git 2020-10-29 06:07:10 +08:00
  • 9e68b77796 [RLLIB] Wait for remote_workers to finish closing environments before terminating (#11476) mvindiola1 2020-10-28 17:23:06 -04:00
  • fcaf4d80e3 [serve] Make fractional resource usage more obvious in docs (#11580) Edward Oakes 2020-10-28 15:54:36 -05:00
  • ba63ded311 [tune] better error when metric or mode unset in search algorithms (#11646) Kai Fricke 2020-10-28 21:17:59 +01:00
  • 58891551d3 [tune] make tests faster + fix flaky test (#10264) Richard Liaw 2020-10-28 13:14:54 -07:00
  • 9e63f7ccc3 [autoscaler/k8s] ray up 409 error fix (#11660) Gekho457 2020-10-28 15:19:57 -04:00
  • 1d5694ddea [GCS]Use direct getting instead of pub-sub to update load metrics in monitor.py (#11339) Tao Wang 2020-10-29 02:23:18 +08:00
  • c933477915 [new scheduler] Pass test_basic and add CI builds with flag on (#11635) Eric Liang 2020-10-28 11:02:43 -07:00
  • 427b5af0ae [Object spilling] Refactor raylet to add a local object manager class (#11647) Stephanie Wang 2020-10-28 10:38:42 -04:00
  • 70ea1fbe30 [sgd] pin ptl to 1.0.3 (#11664) Richard Liaw 2020-10-28 00:29:01 -07:00
  • 05ad4c7499 [Dashboard] Optimize dashboard datacenter (#11391) fyrestone 2020-10-28 14:49:31 +08:00
  • 55a090fb16 [GCS]Optimize gcs client nodes get function (#11424) fangfengbin 2020-10-28 12:13:19 +08:00
  • c3e246818a [Core] Fix doc string for ray.init() (#11657) yncxcw 2020-10-27 19:27:22 -06:00
  • 273a712786 [GCS]Decouple node failure detector with resoure related operations (#11465) Tao Wang 2020-10-28 06:52:42 +08:00
  • 1c40950877 [autoscaler] Add the cluster_name to docker file mounts directory prefix to make it more unique (#11600) Ameer Haj Ali 2020-10-28 00:33:11 +02:00
  • c4ae94d60b [autoscaler] Azure deployment fixes (#11613) Scott Graham 2020-10-27 18:27:18 -04:00
  • 293483ed0b [k8s][minor] fix error handling (#11653) Richard Liaw 2020-10-27 15:24:07 -07:00
  • 3ce852d345 [docker] Synchronize Torch for Tune & RLlib (#11637) Ian Rodney 2020-10-27 10:37:25 -07:00
  • ebe9a8865c [GCS]Fix a bug that creates invalid connection (#11590) fangfengbin 2020-10-28 01:08:06 +08:00
  • d9f1874e34 [RLlib] Minor fixes (torch GPU bugs + some cleanup). (#11609) Sven Mika 2020-10-27 10:00:24 +01:00
  • e7aafd7d24 [tune] PB2 (#11466) Jack Parker-Holder 2020-10-27 08:03:21 +00:00
  • 349c3ec86b Remove errant "self" argument to NodeProvider static method Edward Oakes 2020-10-27 00:22:41 -05:00
  • fe4a78b7c7 [Hotfix] Pin Pydantic Version (#11622) Simon Mo 2020-10-26 16:52:19 -07:00
  • 1a1ff28d18 [tune] allow tune search spaces to be passed to search algorithms (#11503) Kai Fricke 2020-10-26 19:33:13 +00:00
  • 4ad8af9b0d [tune] More PTL example cleanup (#11585) Richard Liaw 2020-10-26 12:26:14 -07:00
  • b02e61f672 [minor] fix up docs (#11596) Richard Liaw 2020-10-26 12:19:03 -07:00
  • 2da6ad2176 [core] Better error message for named actor not found (#11604) Ian Rodney 2020-10-26 09:46:02 -07:00
  • 0fbee4da0c [GCS] Remove unused ReportBatchHeartbeat/SubscribeHeartbeat (#11567) Tao Wang 2020-10-26 12:06:28 +08:00
  • 11f1bbf03c [tune] use isinstance instead of type for TBXLogger (#11595) Sumanth Ratna 2020-10-25 19:12:44 -04:00
  • 1b357533b1 [tune] Try to enable PTL, SKlearn tests (#11542) Richard Liaw 2020-10-24 01:08:46 -07:00
  • d3ee83205b Remove crashing assert in actor creation for old scheduler (#11577) Eric Liang 2020-10-24 00:05:26 -07:00
  • 5ad5cb61ca Remove outdated numpy serializer (#11587) Siyuan (Ryans) Zhuang 2020-10-23 22:58:05 -07:00
  • c3c72db69b [tune] fixed validation for search metrics (#11583) Raoul Khouri 2020-10-23 20:04:21 -04:00
  • 0979589c7c [dask-on-ray] Convert tuple of object refs to list before ray.get() call. (#11582) Clark Zinzow 2020-10-23 17:39:22 -06:00
  • 84fc622659 [yaml] HotFix for correct example full (#11584) Ian Rodney 2020-10-23 13:55:07 -07:00
  • 395ddb093c [tune] a tiny ptl example (#11497) Richard Liaw 2020-10-22 18:50:34 -07:00
  • 2d9b7355ba Clean up release tests (#11420) Barak Michener 2020-10-22 17:04:41 -07:00
  • 1034083988 [RaySGD] Docs for SGD+Tune usage (#11479) Amog Kamsetty 2020-10-22 13:32:27 -07:00
  • c26dbc1612 [Autoscaler] Do not count unmanaged nodes in load metrics (#11458) Alex Wu 2020-10-21 22:14:21 -07:00
  • a2e12ceb2a [Core] Allow creating tasks/actors in a detached actor when driver has exited (#11493) Kai Yang 2020-10-22 01:45:29 +08:00
  • d3405e74da [autoscaler] SDK fixes (#11517) Ian Rodney 2020-10-23 14:09:47 -07:00
  • aef96d17bf [yaml] HotFix for correct example full (#11584) Ian Rodney 2020-10-23 13:55:07 -07:00
  • caf3b04b27 [Dashboard] Turn on new dashboard by default pt 2 (#11510) Max Fitton 2020-10-23 16:52:14 -04:00
  • 8ee4f7eca3 [tune] fix pbt ptl example (#11573) Kai Fricke 2020-10-23 20:42:13 +01:00
  • 7a0184e081 [docker] Push to DockerHub in CI (#11442) Ian Rodney 2020-10-23 12:02:15 -07:00
  • 1ce0c4965b [Serve] Update front page of serve doc (#11421) architkulkarni 2020-10-23 12:01:04 -07:00
  • 9f804ade5f [Placement Group]Add get all placement group api (#11460) DK.Pino 2020-10-24 02:46:48 +08:00
  • e7aa6441b7 [tune] a tiny ptl example (#11497) Richard Liaw 2020-10-22 18:50:34 -07:00
  • 4348ecf850 Clean up release tests (#11420) Barak Michener 2020-10-22 17:04:41 -07:00
  • 2d1f52c21c [autoscaler] Removed .cleanup() from NodeProvider and commands.py (#11543) Gekho457 2020-10-22 17:46:49 -04:00
  • 47531ac7e6 Resolve Issue #11556 by changing the docs to reference _temp_dir. (#11562) dHannasch 2020-10-22 15:24:46 -06:00
  • 73fa94731f [tune] Add HDFS as Cloud Sync Client (#11524) Frank Gu 2020-10-22 14:12:51 -07:00
  • 083737c63c Deprecate rsync to all nodes (#11563) Eric Liang 2020-10-22 13:45:42 -07:00
  • d87c186721 [RaySGD] Docs for SGD+Tune usage (#11479) Amog Kamsetty 2020-10-22 13:32:27 -07:00
  • d1dd5d578e [RLlib] Fix PyTorch A3C / A2C loss function using mixed reduced sum / mean (#11449) Kingsley Kuan 2020-10-23 03:39:34 +08:00
  • cf2ee94e0c [Autoscaler] Allow users to set the names for security groups created by ray (#11405) Allen 2020-10-22 12:28:59 -07:00
  • 7111a424af [Serve] Add regression test for #11437 (#11539) Simon Mo 2020-10-22 10:45:18 -07:00
  • d1182b827a [Autoscaler] Do not count unmanaged nodes in load metrics (#11458) Alex Wu 2020-10-21 22:14:21 -07:00
  • cac4c82c8a [hotfix] Pin node version (fix linux wheel build) (#11532) Max Fitton 2020-10-21 22:10:09 -04:00
  • dbcb368dea Add --worker-port-list option to ray start (#11481) Edward Oakes 2020-10-21 14:46:45 -05:00
  • 9d765ba740 [autoscaler] Add rsync_exclude and rsync_filter options to cluster config (#11512) Alan Guo 2020-10-21 14:28:33 -07:00
  • cf1c737895 [autoscaler/AWS] Updated AWS Node Provider threading logic (#11422) Gekho457 2020-10-21 21:42:38 -04:00
  • 44fb60b4dd [hotfix] Pin node version (fix linux wheel build) (#11532) Max Fitton 2020-10-21 22:10:09 -04:00
  • af0fde4efd [hotfix] disable sklearn again (#11541) Richard Liaw 2020-10-21 19:04:48 -07:00
  • 155687e0c3 [autoscaler/AWS] Updated AWS Node Provider threading logic (#11422) Gekho457 2020-10-21 21:42:38 -04:00
  • ede9347127 [rllib] Add torch_distributed_backend flag for DDPPO (#11362) (#11425) Philsik Chang 2020-10-22 10:30:42 +09:00