Commit Graph

  • 3fd3cb96ed [Utils] Add Queue async and batch methods (#12578) architkulkarni 2020-12-10 08:49:18 -08:00
  • 38ba238606 [serve] Create FutureResults from ControllerAPI (#12577) Ian Rodney 2020-12-10 08:44:08 -08:00
  • deb33bce84 [RLlib] Add DQN SoftQ learning test case. (#12712) Sven Mika 2020-12-10 14:55:19 +01:00
  • e3b5deb741 [Multi-tenancy] Delete flag enable_multi_tenancy and remove old code path (#10573) Kai Yang 2020-12-10 19:01:40 +08:00
  • d681991773 Add Discourse to readme and make it more prominent in docs. (#12740) Robert Nishihara 2020-12-10 01:13:40 -08:00
  • cf30630d2e [docker] Use legacy resolver (#12741) Ian Rodney 2020-12-10 01:12:46 -08:00
  • b07e5b9a12 increase numpy version for py39 acxz 2020-12-09 23:03:01 -05:00
  • 903a2066a7 Add support for Python 3.9 acxz 2020-12-09 23:00:36 -05:00
  • 2f8e308444 [autoscaler] LoadMetrics missed logger.debug (#12714) Ameer Haj Ali 2020-12-10 03:19:36 +02:00
  • a9da4f3201 [docker] Make Ray-ml more compatible (#12574) Ian Rodney 2020-12-09 16:03:39 -08:00
  • a776209aec Revert "Fix dashboard agent check ppid is raylet pid (#12256)" (#12729) Stephanie Wang 2020-12-09 17:20:38 -05:00
  • d455cae036 Add period to error message. (#12716) dHannasch 2020-12-09 14:58:21 -07:00
  • 974570b4fb oops (#12728) Richard Liaw 2020-12-09 13:38:10 -08:00
  • ee012532fb [core] Use node manager client pool for GCS service #10398 (#12368) Keqiu Hu 2020-12-09 12:44:40 -08:00
  • 8b9197ea8c [Doc] replace github discussion link with discourse (#12684) architkulkarni 2020-12-09 12:43:45 -08:00
  • c9873cdbc3 [Serve] Remove unused assign_request wrapper (#12721) Edward Oakes 2020-12-09 14:22:43 -06:00
  • 0b6e44efb8 [New scheduler] Cluster Resource Scheduler dynamic resources (for placement groups) (#12518) Alex Wu 2020-12-09 12:05:31 -08:00
  • ef9ebbc636 [GCS]GCS based Actor Scheduling support actor colocation (#12707) fangfengbin 2020-12-10 03:54:23 +08:00
  • ea25482f6a WIP. (#12706) Sven Mika 2020-12-09 20:49:21 +01:00
  • 19542c5eb0 [docker] Default to ray-ml image (#12703) Ian Rodney 2020-12-09 11:49:16 -08:00
  • 6f3aacd087 [serve] Clarify conda env docs (#12679) architkulkarni 2020-12-09 11:35:48 -08:00
  • f6241302a8 [RLlib] Fix issue 12678: MultiAgentBatch has no attribute total. (#12704) Sven Mika 2020-12-09 16:41:13 +01:00
  • 3ce9286977 Fix dashboard agent check ppid is raylet pid (#12256) fyrestone 2020-12-09 22:12:34 +08:00
  • 840de49161 Fix race condition between failure detection and references going out of scope (#12573) Stephanie Wang 2020-12-09 02:49:55 -05:00
  • 28108c905b [RLlib] Tf-eager policy bug fix: Duplicate model call in compute_gradients. (#12682) Sven Mika 2020-12-09 08:03:58 +01:00
  • cab46b7931 Improve issue templates (#12687) Eric Liang 2020-12-08 22:29:03 -08:00
  • bd7e26b768 [Autoscaler] Temporarily suppress "Removed stale ip mappings" message. (#12689) Alex Wu 2020-12-08 21:55:10 -08:00
  • dc4b5c7aa3 [ray_client] Passing actors to actors (#12585) Barak Michener 2020-12-08 21:54:55 -08:00
  • d534719af6 temporary-fix (#12700) Richard Liaw 2020-12-08 21:48:26 -08:00
  • a4dbb271bd [hotfix][autoscaler] Request resources refactor2 (#12661) Ameer Haj Ali 2020-12-09 04:41:30 +02:00
  • 343b479ae2 [TEST] Fix Ray windows build for debugger (#12671) Philipp Moritz 2020-12-08 18:12:48 -08:00
  • e40b14d255 [RLlib] Batch-size for truncate_episode batch_mode should be confgurable in agent-steps (rather than env-steps), if needed. (#12420) Sven Mika 2020-12-09 01:41:45 +01:00
  • fd4e025da6 [serve] Add docs on configuring cv2 parallelism (#12652) Edward Oakes 2020-12-08 16:03:13 -06:00
  • 50f28811ac [new scheduler] Always spill back to a feasible node if the local node is not feasible (#12557) Stephanie Wang 2020-12-08 13:46:58 -05:00
  • b7404e7955 [dashboard] Resolve npm vulnerabilities (#12620) Sumanth Ratna 2020-12-08 13:26:49 -05:00
  • df10b84113 [Release] release tests yamls for Tune & GPU (#12496) Kai Fricke 2020-12-08 19:15:07 +01:00
  • f61bc79a87 Dmitri/k8s command runner home try again (#12609) Gekho457 2020-12-08 09:44:22 -08:00
  • 2a9079aef9 [grpc]'ray memory' fails if there are many objects in scope #8502 (#12673) Keqiu Hu 2020-12-08 09:36:53 -08:00
  • 4c0f0ce3a9 [RLlib] In OffPolicyEstimators (Offline RL): Include last step of trajectory (#12619) Felipe Antunes 2020-12-08 08:39:40 -03:00
  • f27ceecbf6 [doc] update lint script location (#12670) Keqiu Hu 2020-12-07 22:26:42 -08:00
  • 162f361dab [Logging] Fix log monitor issue (#12588) SangBin Cho 2020-12-07 22:01:18 -08:00
  • cc2f43c826 [Dashboard][Bugfix] Fix bug in display of worker logs and errors in Dashboard (#12660) Max Fitton 2020-12-07 21:41:13 -08:00
  • 34b9c7449b [Dashboard] Fix object store memory display. (#12664) Max Fitton 2020-12-07 21:40:49 -08:00
  • 93c0eb249c [PlacementGroup]Support acquire and return bundle resource from gcs resource manager (#12349) fangfengbin 2020-12-08 10:29:57 +08:00
  • b1f2b142d5 [Core] Ensure global state is connected when exception hook is called from the driver. (#12655) SangBin Cho 2020-12-07 18:28:32 -08:00
  • 040cf2c13b [Doc] Placement group doc small update (#12594) SangBin Cho 2020-12-07 13:58:27 -08:00
  • 3ee4612696 [Release] Fix cluster.yaml (#12589) SangBin Cho 2020-12-07 13:52:30 -08:00
  • 340b1e99fc [RLlib] Fix JAX import bug. (#12621) Sven Mika 2020-12-07 20:05:08 +01:00
  • 7e1422e925 [PlacementGroup]Fix placement group strict spread bug when node dead (#12647) fangfengbin 2020-12-07 21:50:28 +08:00
  • 99c81c6795 [RLlib] Attention Net prep PR #3. (#12450) Sven Mika 2020-12-07 13:08:17 +01:00
  • 401d342602 [PlacementGroup]Add PlacementGroup wait python api (#12601) fangfengbin 2020-12-07 13:53:49 +08:00
  • 73a1a232b9 Ray debugger stepping between tasks (#12075) Philipp Moritz 2020-12-06 21:50:18 -08:00
  • 260b07cf0c [PlacementGroup]Add PlacementGroup wait java api (#12499) fangfengbin 2020-12-05 16:40:04 +08:00
  • 1c0d10f67e [tune] Add xgboost_ray integration (#12572) Kai Fricke 2020-12-04 22:59:20 +01:00
  • 219c445648 [tune] verbosity refactor second attempt (#12571) Kai Fricke 2020-12-04 22:56:26 +01:00
  • 7cad648370 [SGD] Fixes TorchTrainer scales up (#12563) Xianyang Liu 2020-12-05 05:55:15 +08:00
  • f965537ae9 [tune] Callable accepted for register_env (#12618) Marci 2020-12-04 21:21:25 +01:00
  • 0138c2dbb4 [Metrics] Remove redundant unit specification. (#12595) SangBin Cho 2020-12-04 00:06:21 -08:00
  • 21fcee28f9 [Java] Simplify Ray.init() by invoking ray start internally (#10762) Kai Yang 2020-12-04 14:33:45 +08:00
  • 8cebe1e79c [autoscaler] Fix worker capping fifo test in new scheduler (#12512) Eric Liang 2020-12-03 17:21:35 -08:00
  • 515f67034a [tune] debug py37 build (#12597) Richard Liaw 2020-12-03 13:47:54 -08:00
  • 1ce5e0e99f [tune] Fix file descriptor leak by syncer (#12590) Richard Liaw 2020-12-03 13:39:04 -08:00
  • 36e46ed923 Revert "[autoscaler/k8s] Use ray node's HOME in Kubernetes command runner. (#12417)" (#12607) Eric Liang 2020-12-03 12:57:59 -08:00
  • 1f7a4806ff [Serve] Fix Flask Request self reference (#12560) Simon Mo 2020-12-03 08:45:04 -08:00
  • f669830de6 [autoscaler/k8s] Use ray node's HOME in Kubernetes command runner. (#12417) Gekho457 2020-12-03 08:43:16 -08:00
  • 3f4bc16276 [RLlib] Add a minimal JAX ModelV2 (FCNet) to RLlib. (#12502) Sven Mika 2020-12-03 15:51:30 +01:00
  • ff34563539 [PlacementGroup]Fix bug that kill workers mistakenly when gcs restarts (#12568) fangfengbin 2020-12-03 17:50:48 +08:00
  • 7c58a85fed [tune] fix Tensorboard file descriptor leak (#12425) Richard Liaw 2020-12-03 00:06:54 -08:00
  • 62fbe63f34 Disable flaky test test_delete_objects_multi_node (#12584) Eric Liang 2020-12-02 19:19:12 -08:00
  • 8058c1eb54 [serve] Add option to not start HTTP servers (#11627) Edward Oakes 2020-12-02 16:49:34 -06:00
  • a5c846c83b [Dashboard][Bugfix] Filter dead nodes from Machine View (fixes duplicate node issue) (#12579) Max Fitton 2020-12-02 14:08:14 -08:00
  • 2ec7b7367e [doc] update contributing doc (#12564) Keqiu Hu 2020-12-02 12:08:30 -08:00
  • 7422abddb4 [tune] trim kwargs in shim instantiation functions (#12544) Kaushik B 2020-12-03 01:37:00 +05:30
  • da42bf29d0 [tune] horovod release test (#12495) Richard Liaw 2020-12-02 12:04:54 -08:00
  • 443339ab19 [core] Move out-of-memory handling into the plasma store and support async object creation (#12186) Stephanie Wang 2020-12-02 13:25:54 -05:00
  • 786f839ff3 [Windows] Fix windows build (#12555) Ian Rodney 2020-12-02 09:37:40 -08:00
  • 0a12eba603 Revert "Fix race condition between failure detection and references going out of scope (#12548)" (#12570) Kai Fricke 2020-12-02 16:20:17 +01:00
  • a21523c709 [tune/core] serialization debugging utility (#12142) Richard Liaw 2020-12-02 00:52:17 -08:00
  • 63b85df828 [xgb] update docs (#12549) Kai Fricke 2020-12-02 08:17:23 +01:00
  • e428134137 [Hotfix] Pin llvmlite for windows build (#12559) Simon Mo 2020-12-01 19:43:08 -08:00
  • 615f974313 Add context for "test_buffer_alignment" (#12519) Siyuan (Ryans) Zhuang 2020-12-01 19:27:14 -08:00
  • 8801e87afd Fix race condition between failure detection and references going out of scope (#12548) Stephanie Wang 2020-12-01 20:52:30 -05:00
  • 19c8033df2 [RLlib] Fix most remaining RLlib algos for running with trajectory view API. (#12366) Sven Mika 2020-12-02 02:41:10 +01:00
  • 4dc16730a7 [tune] with-params fix (#12522) Richard Liaw 2020-12-01 16:47:03 -08:00
  • 7022278ce9 Deflake Serve tests (#12542) Simon Mo 2020-12-01 13:42:21 -08:00
  • 4288b5b9ff [placement group] -1 option for placement group index (#12532) Ameer Haj Ali 2020-12-01 23:16:18 +02:00
  • 981df65b91 [Doc] Improve the placement group document (#12507) SangBin Cho 2020-12-01 13:15:30 -08:00
  • 6412dfaf38 [ray_client] actors v0 (#12388) Barak Michener 2020-12-01 13:12:08 -08:00
  • 0e892908f7 [Object Spilling] Delete spilled objects when references are gone out of scope. (#12341) SangBin Cho 2020-12-01 13:10:39 -08:00
  • ef1b0c13c3 Async Future Throws RayError as well (#12419) Simon Mo 2020-12-01 13:07:43 -08:00
  • bdf8ad3b5a fix (#12528) Richard Liaw 2020-12-01 09:58:12 -08:00
  • f596113fc7 [Core] Actor Retries Out of Order Tasks on Restart (#12338) Simon Mo 2020-12-01 09:35:54 -08:00
  • f6f3cc9af1 [Core]Remove checkpoint table (#12235) SangBin Cho 2020-12-01 08:58:36 -08:00
  • 9021f15b2a [RLlib] Fix setup-dev.py error when creating a softlink for new_dashboard. (#12442) Sven Mika 2020-12-01 11:46:59 +01:00
  • 3ad9365e1d [RLlib] Attention Net prep PR #2: Smaller cleanups. (#12449) Sven Mika 2020-12-01 08:21:45 +01:00
  • e72147de38 Fix Serve typo (#12524) Edward Oakes 2020-12-01 01:15:42 -06:00
  • fd8ae0697b [autoscaler] Fix test heartbeats single test (#12513) Eric Liang 2020-11-30 21:24:45 -08:00
  • 16ca748454 [CI] Use legacy resolver for some pip imports (#12517) Amog Kamsetty 2020-11-30 21:18:21 -08:00
  • f9a99f20dd Revert "Re-Revert "[Core] zero-copy serializer for pytorch (#12344)" (#12478)" (#12515) Amog Kamsetty 2020-11-30 19:05:55 -08:00
  • 8223a33bff [Logging] Log rotation on all components (#12101) SangBin Cho 2020-11-30 19:03:55 -08:00