Commit Graph

  • a4b418d30c [docs] update cloud docs (#11262) Richard Liaw 2020-10-21 16:37:26 -07:00
  • e02f4c0157 [New scheduler] queue by shape (#11381) Alex Wu 2020-10-21 15:56:06 -07:00
  • 920e4b2ef8 Try to raise ulimit for file descriptors to max allowed; warn if ulimit is still too low (#11515) Eric Liang 2020-10-21 14:29:43 -07:00
  • e8c77e2847 Remove memory quota enforcement from actors (#11480) Eric Liang 2020-10-21 14:29:03 -07:00
  • 8c82369cad [autoscaler] Add rsync_exclude and rsync_filter options to cluster config (#11512) Alan Guo 2020-10-21 14:28:33 -07:00
  • 9522918fa2 [tune] reenable sklearn (#11192) Richard Liaw 2020-10-21 14:21:38 -07:00
  • 5d7f271e7d Add --worker-port-list option to ray start (#11481) Edward Oakes 2020-10-21 14:46:45 -05:00
  • da2d3fbcfc Remove unused field in heartbeat message (#11459) Tao Wang 2020-10-22 01:49:16 +08:00
  • 078a22d676 [Core] Allow creating tasks/actors in a detached actor when driver has exited (#11493) Kai Yang 2020-10-22 01:45:29 +08:00
  • 7200ddb72d Fix code_search_path failed in java (#11406) Xuxue1 2020-10-21 18:10:48 +08:00
  • aeea168940 [tune] Update for ZOOpt (#11491) Servon 2020-10-21 14:56:20 +08:00
  • a075e37695 [GCS]Fix TestActorTableResubscribe bug (#11463) fangfengbin 2020-10-21 13:32:41 +08:00
  • 2fc3237045 [Dashboard] Update dashboard port checking so that we can instantly reuse the dashboard port (#11487) Allen 2020-10-20 19:19:50 -07:00
  • 6d11fb8bc6 [tune] validate function callable in tune.with_parameters() (#11504) Kai Fricke 2020-10-21 00:03:24 +01:00
  • 2c5cb95b42 [Serve] Get ServeHandle on the same node (#11477) Simon Mo 2020-10-20 10:44:23 -07:00
  • ef96793d3f [Serve] [Doc] Clarify custom method call (#11485) Simon Mo 2020-10-20 10:41:30 -07:00
  • e663b524ae Enable highlighting (#11500) Sumanth Ratna 2020-10-20 12:34:39 -04:00
  • cdca5af53b Revert "[Dashboard] Turn on New Dashboard by Default (#11321)" (#11502) Max Fitton 2020-10-20 11:53:10 -04:00
  • cbc5dac6c4 [Java] fix shutdown lock typo (#11410) Kai Yang 2020-10-20 22:31:27 +08:00
  • aed739fbf4 [Log] Ignore callstacktrace test for windows (#11413) Lingxuan Zuo 2020-10-20 15:21:29 +08:00
  • 15cbfae8ad fix cross language ut (#11492) DK.Pino 2020-10-20 14:14:40 +08:00
  • 1b3b009f7a [PlacementGroup]Add guarded by in placement group scheduler ut (#11306) DK.Pino 2020-10-20 09:54:35 +08:00
  • 0a9cc9cce5 Revert "remove .fake build files (#11478)" (#11488) Max Fitton 2020-10-19 21:48:32 -04:00
  • 1bce7518f3 [Serve][Doc] Add FAQ Page (#11258) Simon Mo 2020-10-19 17:15:22 -07:00
  • 09932ee72b [docs] correct docker image name in Launch Ray in Docker section (#11475) Lee moon soo 2020-10-19 16:01:03 -07:00
  • ed81010b0d [tune] support hierarchical search spaces for hyperopt (#11431) Kai Fricke 2020-10-20 00:00:08 +01:00
  • 3ed3dea004 remove .fake build files (#11478) Max Fitton 2020-10-19 18:36:47 -04:00
  • 2f74fe5b71 [tune/docs] Add PTL example to tune docs/examples (#11474) Kai Fricke 2020-10-19 22:47:58 +01:00
  • 79fd1cfe6f Update dashboard build instructions for new dashboard (#11471) Max Fitton 2020-10-19 16:12:22 -04:00
  • 4946c1e6de [docker] Minor improvements (#11352) Sumanth Ratna 2020-10-19 15:42:53 -04:00
  • acbd12eabf [Docker] Set Docker as the Default (#11416) Ian Rodney 2020-10-19 10:53:30 -07:00
  • f500292d41 [Dashboard] Turn on New Dashboard by Default (#11321) Max Fitton 2020-10-19 13:31:11 -04:00
  • 202b1859ef [tune] Update ZOOpt to better support the latest Ray (#11462) Servon 2020-10-20 00:28:11 +08:00
  • 798bd6a359 [dashboard] Add /api/cluster_status endpoint (#11456) Edward Oakes 2020-10-19 11:00:47 -05:00
  • da89cb19eb [GCS]Fix node info idempotent bug (#11423) fangfengbin 2020-10-19 10:23:33 +08:00
  • 8581dd2fb1 [Autoscaler] Staroid node provider followup improvements (#11408) Lee moon soo 2020-10-18 00:26:11 -07:00
  • 48b75a6922 [docker] latest -> nightly on docker image tag on build-docker.sh script (#11403) Lee moon soo 2020-10-18 00:24:28 -07:00
  • f37e967ada [Docs] READmes for DockerHub (#11350) Ian Rodney 2020-10-16 15:46:13 -07:00
  • 6d6a536601 [Docker] Fix Docker Stop on Teardown (#11400) Ian Rodney 2020-10-16 15:45:33 -07:00
  • 6233cef22c [Placement Group] Remove useless placement group argument in ray remote (#11389) DK.Pino 2020-10-17 05:17:18 +08:00
  • 50be2970dc [autoscaler]Type hints for commands.py and sdk.py. (#11354) Gekho457 2020-10-16 16:45:36 -04:00
  • 2aec77e305 [RLlib] Fix two test cases that only fail on Travis. (#11435) Sven Mika 2020-10-16 20:53:30 +02:00
  • f890808c14 [docs] Add xgboost_ray to community integrations (#11428) Kai Fricke 2020-10-16 18:15:28 +01:00
  • 7d1015d23e [Doc] Fix Core Link (#11438) Richard Liaw 2020-10-16 10:10:56 -07:00
  • 2f3ec4ef75 add streaming data writer unit tests (#11387) Lingxuan Zuo 2020-10-16 14:42:44 +08:00
  • 049985549b Restore uptime timeout to 5 seconds (#11300) mattearllongshot 2020-10-16 01:21:11 +01:00
  • 436202bcfd [docs] Fix typos in documentation (#11414) herve-alanaai 2020-10-16 01:00:48 +01:00
  • afd797b896 [docker] Check for GPUs before setting runtime-nvidia (#11418) Ian Rodney 2020-10-15 15:43:09 -07:00
  • 38eb61442b [SGD] Callback API for SGD+Tune (#11316) Amog Kamsetty 2020-10-15 15:22:14 -07:00
  • 414041c6dd [RLlib] Do not create env on driver iff num_workers > 0. (#11307) Sven Mika 2020-10-15 18:21:30 +02:00
  • 60a4be4a59 [tune] Remove metric and mode kwargs from create_searcher (#11335) Sumanth Ratna 2020-10-15 00:44:36 -04:00
  • 3fe757391b [tune] Add Basic Variant Generator to search algorithm shim function (#11334) Sumanth Ratna 2020-10-15 00:41:47 -04:00
  • 00e0f14c6f [tune] restore trials when sync_on_checkpoint is False (#11355) Vishnu Deva 2020-10-15 07:39:34 +05:30
  • f7120d2a18 [tune] Make metrics parameter optional in pytorch lightning integration (#11402) Kai Fricke 2020-10-15 01:50:34 +01:00
  • 34191107a3 [cpp worker] fix crash in empty args task (#11363) SongGuyang 2020-10-15 07:48:34 +08:00
  • 666fcde8ca [Placement group] Input validation (#11152) SangBin Cho 2020-10-14 13:56:41 -07:00
  • a10e36ca04 Make the logging of gc.collect() freed refs appear in DEBUG not INFO (#11353) Ameer Haj Ali 2020-10-14 13:14:35 -07:00
  • 7466ce82df [Autoscaler] Placement group autoscaling (#11243) Alex Wu 2020-10-14 13:11:46 -07:00
  • aefcf901d3 [docs] Add sklearn integration link Eric Liang 2020-10-14 13:07:23 -07:00
  • b1481c6acf Revert "[PlacementGroup]Add node manager test framework (#11174)" (#11398) SangBin Cho 2020-10-14 11:09:20 -07:00
  • 149ec5f6bf [Log] dump stacktrace from glog lib (#11360) Lingxuan Zuo 2020-10-15 01:52:12 +08:00
  • abc6126814 [Java] Release actor instance reference when Ray.exitActor() is invoked (#11324) Kai Yang 2020-10-14 13:12:59 +08:00
  • c926838411 [GCS]Fix GcsActorManagerTest multithreading bug (#11361) fangfengbin 2020-10-14 12:36:40 +08:00
  • 5637093f44 Add Serve load testing tool to long running test yaml (#11386) Simon Mo 2020-10-13 20:24:57 -07:00
  • 866193b01c Fix cluster yaml for serve benchmarks (#11383) Simon Mo 2020-10-13 19:30:18 -07:00
  • 241e765d3a [PlacementGroup]Add node manager test framework (#11174) fangfengbin 2020-10-14 10:27:11 +08:00
  • cd9dcfca0d [Dashboard] CPU/GPU usage details in actor pane (#11269) Max Fitton 2020-10-13 21:23:23 -04:00
  • 933cf6675c [Tune] Changes for Pytorch Lightning 1.0 (#11375) Amog Kamsetty 2020-10-13 15:50:11 -07:00
  • a6a94d3206 [RLlib] Fix test_env_with_subprocess.py. (#11356) Sven Mika 2020-10-13 21:42:20 +02:00
  • 63fa0a53a3 [k8s] Fix kubernetes cloud cluster example configuration (#11364) J Seppänen 2020-10-13 22:28:55 +03:00
  • 84617f6ff6 [docker] Script for quickly fixing all Latest images (#11351) Ian Rodney 2020-10-13 09:36:40 -07:00
  • 39e809fa03 Update microbenchmark script to use Python 3.8 wheel (#11357) Simon Mo 2020-10-13 09:27:52 -07:00
  • 0c02427da2 [GCS]Eviction of destroyed actors cached in GCS (#11338) fangfengbin 2020-10-13 15:34:35 +08:00
  • c84a9b457c [Streaming] add barrier helper tests (#11107) Lingxuan Zuo 2020-10-13 09:55:55 +08:00
  • 6426fb3fff [CI] Fix-Up Docker Build (Use Python) (#11139) Ian Rodney 2020-10-12 14:22:51 -07:00
  • 1ebcdf236f [RLlib] Add support for custom MultiActionDistributions. (#11311) Sven Mika 2020-10-12 22:50:43 +02:00
  • 0c0f67c14d [RLlib] ARS/ES eval workers not working: Issue 9933. (#11308) Sven Mika 2020-10-12 22:49:48 +02:00
  • 8ea1bc5ff9 [RLlib] Allow for more than 2^31 policy timesteps. (#11301) Sven Mika 2020-10-12 22:49:11 +02:00
  • f5e2cda68a [RLlib] SAC: log_alpha not being learnt when on GPU. (#11298) Sven Mika 2020-10-12 22:48:44 +02:00
  • 7dcfd258cd [RLlib] Assert LongTensor in SAC Discrete PyTorch (#11245) Julius Frost 2020-10-12 16:47:21 -04:00
  • 580820a530 [RLlib] Create ci/rllib_tests and organize a little (#11342) Sven Mika 2020-10-12 21:05:09 +02:00
  • c107eea551 [Core] Do not report stats when worker is already dead. (#11167) SangBin Cho 2020-10-12 11:57:04 -07:00
  • 56f69543d0 Try to deflake test_failure (#11293) SangBin Cho 2020-10-12 10:03:36 -07:00
  • 06fe690682 [autoscaler] Limit max launch concurrency per node type (#11242) Ameer Haj Ali 2020-10-12 09:45:52 -07:00
  • 92a58aabce [tune][docs] Fix learning rate bounds in FAQ (#11345) Sumanth Ratna 2020-10-12 12:44:53 -04:00
  • 175fc41fbc [Autoscaler] Account for resource backlog size (#11261) Alex Wu 2020-10-12 09:43:48 -07:00
  • d3bc20b727 [RLlib] ConvTranspose2D module (#11231) Sven Mika 2020-10-12 15:00:42 +02:00
  • d1579819e9 [GCS]Eviction of dead nodes cached in GCS (#11323) fangfengbin 2020-10-12 15:54:32 +08:00
  • 31117b5e96 [GCS]Add job id to log (#11331) fangfengbin 2020-10-12 13:53:08 +08:00
  • 0d09a17c64 Skip set_result if the future is done (#11256) Simon Mo 2020-10-11 22:33:58 -07:00
  • f9a29a6d26 [docs] Fix pip install commands (#11326) Alex V. Kotlar 2020-10-12 01:12:18 -04:00
  • 957877ad3f Tf version of VisionNet (ray/rllib/model/tf/vision_net.py) crashes iff len(conv-filters)=1. (#11330) Sven Mika 2020-10-11 12:49:47 +02:00
  • 56f858ed1a [tune][docs/util] gputil check, docs (#11260) Richard Liaw 2020-10-10 00:54:31 -07:00
  • defd41aad7 [Dashboard] http route handler cache (#10921) fyrestone 2020-10-10 13:27:05 +08:00
  • 9dd4561d1b [Placement Group] Fix stress tests to pass when actors are scheduled. (#11151) SangBin Cho 2020-10-09 21:52:26 -07:00
  • 0737e78445 [Java] upgrade common-collections version (#10613) chaokunyang 2020-10-10 11:16:12 +08:00
  • 48db6f8858 [autoscaler/k8s] namespace permissions problem (#11270) Gekho457 2020-10-09 20:22:20 -04:00
  • 92b4059cad Replace read_namespaced_pod_status with read_namespaced_pod (#11278) Gekho457 2020-10-09 20:21:39 -04:00
  • 5ef1784024 [Autoscaler] Fix sdk (#11314) Ian Rodney 2020-10-09 12:34:29 -07:00
  • 3eb2b9e216 [GCS]Random eviction of destroyed actors cached in GCS (#11189) fangfengbin 2020-10-10 02:54:47 +08:00