Commit Graph

  • 84b689f32f remove debug statement Akash Patel 2020-12-17 09:53:32 -05:00
  • e6cb4f4bd7 [Core] Add log of address and port (#12908) Allen 2020-12-17 00:25:29 -08:00
  • 40032541dc [core] Introduce fetch_local to ray.wait (#12526) Yi Cheng 2020-12-16 23:44:28 -08:00
  • 12231ec2a6 Optimize heartbeat manager initialization (#12911) Tao Wang 2020-12-17 14:24:23 +08:00
  • 020ad98f6f install setproctitle from pypi instead of building from source acxz 2020-12-17 00:36:12 -05:00
  • 057687e534 [New Scheduler] Fix test_failure.py by supporting infeasible tasks (#12738) SangBin Cho 2020-12-16 21:27:50 -08:00
  • c8d14eb3c5 update setproctitle to use with py39 acxz 2020-12-16 22:42:31 -05:00
  • ad036fd564 Fix continue for debugger (#12862) Philipp Moritz 2020-12-16 16:09:13 -08:00
  • dd522a71a1 [SGD] Disable Elastic Training by default when using with Tune (#12927) Amog Kamsetty 2020-12-16 15:37:44 -08:00
  • 8b783ecafa Fix pull manager retry (#12907) Alex Wu 2020-12-16 14:18:43 -08:00
  • c677b9e201 [autoscaler] Fix flaky autoscaler test (#12918) Ameer Haj Ali 2020-12-17 00:18:27 +02:00
  • aedcf0c9d9 Disable test_distributions (#12919) Edward Oakes 2020-12-16 16:17:49 -06:00
  • fdb4c6eb1c Better message for too little /dev/shm memory (#12896) Edward Oakes 2020-12-16 10:30:20 -06:00
  • 2b38938305 remove extra newline acxz 2020-12-16 11:06:33 -05:00
  • 7d8a008aeb Merge branch 'master' into py39 Akash Patel 2020-12-16 11:04:27 -05:00
  • 91878d18b5 [PlacementGroup]Fix placement group wait api disorder bug (#12827) fangfengbin 2020-12-16 18:45:53 +08:00
  • 7ff314a5df [New scheduler] Also unsubscribe get dependencies on unblock Eric Liang 2020-12-15 20:29:44 -08:00
  • a7caa14d3d [k8s] avoid bad error messages (#12871) Richard Liaw 2020-12-15 15:00:02 -08:00
  • f4b5a8b2f7 [serve] Re-enable test_failure.py (#12891) Edward Oakes 2020-12-15 16:02:04 -06:00
  • 87cf1a97e5 [core] recover startup logs (#12876) Richard Liaw 2020-12-15 13:49:45 -08:00
  • 6795d7c75c [serve] Fix flaky test_api.py::test_backend_user_config (#12892) Edward Oakes 2020-12-15 15:35:30 -06:00
  • ea1228074d [tune] enable points_to_eval for all search algorithms (#12790) Kai Fricke 2020-12-15 20:51:53 +01:00
  • fdd85e3af4 [Serve] Add benchmark for async handles (#12858) Simon Mo 2020-12-15 11:21:51 -08:00
  • 0031723ace [New scheduler] Object spilling (#12857) Alex Wu 2020-12-15 11:05:38 -08:00
  • cde711aaf1 Revert "[RLLib] Execution-Folder Type Annotations (#12760)" (#12886) Edward Oakes 2020-12-15 13:03:02 -06:00
  • ba12fb1451 Fix for RLIMIT patch (#12882) architkulkarni 2020-12-15 10:38:46 -08:00
  • de7848231c [Doc] Fix placement group doc (#12875) SangBin Cho 2020-12-15 10:36:51 -08:00
  • 261b2f9053 Check for raylet PID as ppid in dashboard agent fate-sharing (#12867) Edward Oakes 2020-12-15 12:13:11 -06:00
  • e077bc4206 [Release] Bump master to 1.2.0 for 1.1.0 release (#12856) Max Fitton 2020-12-15 09:40:26 -08:00
  • b291dd4486 [Metrics] Call GetMeasureDoubleByName to prevent override (#12860) Simon Mo 2020-12-15 09:39:39 -08:00
  • 5a142d5bd6 Use nightly images in all kubernetes examples. (#12868) Gekho457 2020-12-14 20:49:41 -08:00
  • 43b9259d40 [GCS]GCS resource manager support scheduling resource (#12780) fangfengbin 2020-12-15 10:27:55 +08:00
  • 8cebe5cbe9 [docs][autoscaler][k8s][minor] quotes #12866 Gekho457 2020-12-14 18:24:13 -08:00
  • 44f5be04ca [autoscaler][k8s][doc][minor] Fix typo in k8s doc. (#12865) Gekho457 2020-12-14 17:30:43 -08:00
  • b56db5a22f [Serve] Wait for actor name to be cleaned up (#12215) Simon Mo 2020-12-14 15:09:43 -08:00
  • 231518e86f [Serve] Support basic Starlette response types (#12811) architkulkarni 2020-12-14 15:03:56 -08:00
  • d0813c1c58 [Dashboard] Add dashboard multi-node churn test (#11768) Max Fitton 2020-12-14 15:03:33 -08:00
  • c56799e3da disable-for-now (#12838) Richard Liaw 2020-12-14 14:18:31 -08:00
  • 1eb4ac12b1 Clip RLIMIT_NOFILE increase to avoid redis failing to start on Big Sur Eric Liang 2020-12-14 14:05:19 -08:00
  • 69b0bc2132 [Logging] Use file handle temporalily (#12839) SangBin Cho 2020-12-14 11:42:44 -08:00
  • ac53e2f857 [GCS]Tell dead nodes to commit suicide (#12792) Tao Wang 2020-12-15 03:42:00 +08:00
  • becca1424d [RLLib] Execution-Folder Type Annotations (#12760) Michael Luo 2020-12-14 10:16:44 -08:00
  • 11ce1dc743 Ray cluster CRD and example CR + multi-ray-cluster operator (#12098) Gekho457 2020-12-14 08:26:01 -08:00
  • 35f7d84dbe Revert heartbeat interval to keep ci stable (#12836) Tao Wang 2020-12-14 16:58:40 +08:00
  • 22c1968d62 Runing -> Running (#12826) Eric Squires 2020-12-14 01:23:48 -05:00
  • aaa11941f6 [autoscaler] Fix flaky autoscaler test (#12829) Ameer Haj Ali 2020-12-14 03:09:30 +02:00
  • 3c808835a5 [RLlib] Issue 12831: AttributeError: 'NoneType' object has no attribute 'id' when using custom Atari env. (#12832) Sven Mika 2020-12-13 16:15:54 +01:00
  • 1e02b28abe [GCS]Move node resource info to gcs resource manager (#12775) fangfengbin 2020-12-13 20:37:34 +08:00
  • ac24d1db30 [Dashboard][Bugfix] Fix GPU List Bug (#12666) Max Fitton 2020-12-12 23:34:24 -08:00
  • 153b24746c [Placement Group] Refactor pg resource constrain in node manager (#12538) DK.Pino 2020-12-13 15:32:15 +08:00
  • bdc6624da8 Revert "[PlacementGroup]Add PlacementGroup wait python api (#12601)" (#12825) Eric Liang 2020-12-12 12:13:48 -08:00
  • b73d4831d4 Add grace period before warning of resource deadlock Eric Liang 2020-12-12 12:02:13 -08:00
  • 6eb0e6f734 [format] Improve formatting with a real .flake8 file (#12800) Barak Michener 2020-12-12 11:34:30 -08:00
  • 2f2bd884a3 [tune] upgrade gpytorch, bump default pytorch to 1.7.0 (#12776) Richard Liaw 2020-12-12 10:35:33 -08:00
  • 7e09f1d934 remove-xgboost-build (#12822) Richard Liaw 2020-12-12 10:34:56 -08:00
  • c22990a537 [GCS]GCS node manager rename GetNode to GetAliveNode (#12781) fangfengbin 2020-12-12 20:34:43 +08:00
  • 5f04ade6ef [tune] add more stoppers and stopper documentation (#12750) Kai Fricke 2020-12-12 10:47:19 +01:00
  • 905652cdd6 [tune] migrate xgboost callback api (#12745) Kai Fricke 2020-12-12 10:42:20 +01:00
  • 42c70be073 [tune] Hyperopt: Directly accept category variables instead of indices (#12715) Kai Fricke 2020-12-12 10:40:53 +01:00
  • 0b1fbc5e83 [PR 1/6] Collective in Ray (#12637) Hao Zhang 2020-12-12 04:26:36 -05:00
  • aa64cd4534 [New scheduler] Fix test_global_state (#12586) Alex Wu 2020-12-11 21:47:01 -08:00
  • 03d869d51c Hold GIL while submitting (actor) tasks (#12803) Edward Oakes 2020-12-11 21:47:16 -06:00
  • aec5c9879e Add tests for atexit handler behavior (#12808) Edward Oakes 2020-12-11 21:47:05 -06:00
  • 6262ee1f76 Clarify docs for atexit behavior when using ray.kill (#12807) Edward Oakes 2020-12-11 21:45:39 -06:00
  • bd866d926d debug sys.path for setproctitle acxz 2020-12-11 22:07:24 -05:00
  • 1ce745cf44 Add automatic local GC and plasma debug logs every 10 minutes by default (#12804) Eric Liang 2020-12-11 17:09:58 -08:00
  • 496e449a8b Switch long running test cluster yaml to ml image Max Fitton 2020-12-11 15:43:51 -08:00
  • abb1eefdc2 [RLlib] Issue 12483: Discrete observation space error: "ValueError: ('Observation ({}) outside given space ..." when doing Trainer.compute_action. (#12787) Sven Mika 2020-12-11 22:43:30 +01:00
  • 676ec363f6 [Object Manager] Pull Manager refactor (#12335) Alex Wu 2020-12-11 11:56:23 -08:00
  • 3d8c1cbae6 [Serve] Fix Serve Release Tests (#12777) Simon Mo 2020-12-11 11:53:47 -08:00
  • 4ad4463be6 Add comments to clarify purpose of new scheduler queues (#12730) Eric Liang 2020-12-11 11:53:09 -08:00
  • 9b3863b81b update long running release tests Max Fitton 2020-12-11 10:22:10 -08:00
  • 9ded69fdaa [Hotfix] Fix python client lint error (#12783) fangfengbin 2020-12-12 02:15:53 +08:00
  • 68d7fa2137 Fix exit_actor in asyncio mode (#12693) Simon Mo 2020-12-11 09:35:17 -08:00
  • 699ded5328 [serve] Initial commit for CLI (#12770) Edward Oakes 2020-12-11 10:31:29 -06:00
  • 74c98ac38e [RLlib] Issue 12244: Unable to restore multi-agent PPOTFPolicy's Model (from exported). (#12786) Sven Mika 2020-12-11 16:13:38 +01:00
  • 295b6e5ce4 Split heartbeat message (#12535) Tao Wang 2020-12-11 21:19:57 +08:00
  • 867d2a8aa3 [Streaming] Add more documents. (#12746) Lixin Wei 2020-12-11 20:36:17 +08:00
  • a082ea18b8 [RLlib] Issue 12212: "TFEagerPolicy has no attribute action_sampler_fn. Sven Mika 2020-12-11 12:57:33 +01:00
  • 86b0741026 [new scheduler] Allocate resources for spilled back task to a local view of the remote node (#12711) Stephanie Wang 2020-12-10 22:43:29 -05:00
  • b7f246c451 [ray_client] Include multiple facets of the Ray API (#12736) Barak Michener 2020-12-10 19:09:34 -08:00
  • 8d1ad25545 [docs] Add troubleshooting section to installation page (#12659) Sumanth Ratna 2020-12-10 21:56:56 -05:00
  • 9b3ef2f340 [docs] Fix Docker links (#12702) Ian Rodney 2020-12-10 18:08:48 -08:00
  • 62d6b0a558 Fix max_task_retries for named actors (#12762) Edward Oakes 2020-12-10 18:24:55 -06:00
  • 0e90cbcd19 Remove unused ci/performance_tests (#12767) Edward Oakes 2020-12-10 18:23:16 -06:00
  • 6532e30402 hard code link to release candidate wheel in release tests Max Fitton 2020-12-10 16:17:49 -08:00
  • c7b6ec88ef [serve] Make serve __del__ log DEBUG level (#12766) Edward Oakes 2020-12-10 18:14:55 -06:00
  • 3c44c0d3e4 [serve] Long polling for routes in http server (#12724) Edward Oakes 2020-12-10 18:02:02 -06:00
  • 006856b9a1 fix gpu base image name in build-docker.sh script (#12642) Lee moon soo 2020-12-10 14:31:59 -08:00
  • 932837eb4c [streaming] Remove unused imports in streaming CI tests (#12722) Sumanth Ratna 2020-12-10 17:27:06 -05:00
  • 2e084959a1 Fix a wrong import in test_performance.py (#12734) Ruoyun Huang 2020-12-10 14:26:21 -08:00
  • 231ecffa3d add tags.lock and tags.temp to .gitignore (#12752) Eric Squires 2020-12-10 17:24:32 -05:00
  • 9f70293700 Remove debug extras from setup.py (#12751) Eric Squires 2020-12-10 17:23:11 -05:00
  • c7239d7b73 [hotfix][autoscaler] Request resources refactor2 (#12661) Ameer Haj Ali 2020-12-09 04:41:30 +02:00
  • ee2cdc0906 oops (#12728) Richard Liaw 2020-12-09 13:38:10 -08:00
  • 1305f5d4e5 [GCS]GCS based Actor Scheduling support actor colocation (#12707) fangfengbin 2020-12-10 03:54:23 +08:00
  • ec81eca6b0 [TEST] Fix Ray windows build for debugger (#12671) Philipp Moritz 2020-12-08 18:12:48 -08:00
  • 16f7abfb8f [dashboard] Resolve npm vulnerabilities (#12620) Sumanth Ratna 2020-12-08 13:26:49 -05:00
  • 986446d15c [Release] release tests yamls for Tune & GPU (#12496) Kai Fricke 2020-12-08 19:15:07 +01:00
  • 38249ae035 [docker] Use legacy resolver (#12741) Ian Rodney 2020-12-10 01:12:46 -08:00