Commit Graph

  • 2708b3abbc [Dashboard][Bug] Fix duplicate node total rows in dashboard (#12410) Max Fitton 2020-11-30 18:43:09 -08:00
  • e422ace053 [serve] Create CurrentState & GoalState (#12369) Ian Rodney 2020-11-30 17:34:30 -08:00
  • 234df9091e [autoscaler] Try to improve the request_resources() documentation (#12465) Eric Liang 2020-11-30 16:03:30 -08:00
  • 9ce7ad17fd [tune] remove some bottlenecks in trialrunner (#12476) Richard Liaw 2020-11-30 14:54:25 -08:00
  • f5fe3794c8 [Docker] Uninstall Typing (#12500) Ian Rodney 2020-11-30 14:12:57 -08:00
  • 3f22448834 Re-Revert "[Core] zero-copy serializer for pytorch (#12344)" (#12478) Siyuan (Ryans) Zhuang 2020-11-30 11:43:03 -08:00
  • bb03e2499b [RLlib] PyBullet Env native support via env str-specifier (if installed). (#12209) Sven Mika 2020-11-30 12:41:24 +01:00
  • b85c6abc3e Rename fields/variables from client id to node id (#12457) Tao Wang 2020-11-30 14:33:36 +08:00
  • 3964defbe1 [Logging] Fix tensorflow logging issue. (#12225) SangBin Cho 2020-11-29 22:16:52 -08:00
  • 91d54ef621 [Core] Remove actor arg from executor to allow users to specify actor… (#12239) SangBin Cho 2020-11-29 22:15:48 -08:00
  • 17a6b9bbe7 Fix not cp jars (#12456) chaokunyang 2020-11-30 13:53:09 +08:00
  • cf73ccddae Allow more fields for object metadata (#12484) Philipp Moritz 2020-11-29 21:50:18 -08:00
  • f1cc33a6a6 Actor resource backlog hotfix (#12471) Alex Wu 2020-11-29 20:55:50 -08:00
  • fb318addcb [RLlib] Curiosity exploration module: tf/tf2.x/tf-eager support. (#11945) Sven Mika 2020-11-29 12:31:24 +01:00
  • a537b852e6 [docs][core] Documentation improvement in master/walkthrough.html (#12473) Micah Yong 2020-11-28 20:36:01 -08:00
  • 8a406e1f9a [SGD] Add PTL Docs (#12440) Amog Kamsetty 2020-11-28 10:09:38 -08:00
  • 60a545ab57 [RLLib] Fix HyperOptSearch tuple to list conversion (#12462) Pierre TASSEL 2020-11-28 19:07:54 +01:00
  • 1d0ade1b93 Revert "[Core] zero-copy serializer for pytorch (#12344)" (#12469) Kai Fricke 2020-11-28 19:06:02 +01:00
  • 9ad0f173d6 Prestart workers to avoid slow start when multi-tenancy is enabled (#12430) Eric Liang 2020-11-27 21:47:46 -08:00
  • 0df55a139c [RLlib] Attention Net prep PR #1: Smaller cleanups. (#12447) Sven Mika 2020-11-28 01:25:47 +01:00
  • 569eee5e71 Enable more new scheduler tests (#12421) Eric Liang 2020-11-27 16:10:38 -08:00
  • affb0b776c Fix github issue template (#12464) Richard Liaw 2020-11-27 14:13:29 -08:00
  • 7c009d22cf [docs] Add xgboost_ray to docs (#12184) Richard Liaw 2020-11-27 11:36:56 -08:00
  • 0a505ca83d [Core] zero-copy serializer for pytorch (#12344) Siyuan (Ryans) Zhuang 2020-11-26 16:09:54 -08:00
  • e0573df337 [CI] Fix windows build (#12415) Amog Kamsetty 2020-11-26 10:52:11 -08:00
  • c1d7826bb7 [RLlib] Move pettingzoo from requirements.txt into requirements_rllib.txt (#12400) Sven Mika 2020-11-26 19:30:35 +01:00
  • 6475297bd3 [RLlib] Torch LR schedule not working. Fix and added test case. (#12396) Sven Mika 2020-11-26 13:14:11 +01:00
  • d5215745e4 [PlacementGroup] Introduce GcsResourceManager and avoid copying resources when scheduling placement groups (#12253) fangfengbin 2020-11-26 11:21:58 +08:00
  • 90d7863eb3 Document that ray install-nightly command doesn't work for 1.0.1.post1 and older (#12429) Edward Oakes 2020-11-25 19:55:12 -06:00
  • 9ccf5f6ccc [ray client] add metadata and secure options to Worker. (#12409) Ameer Haj Ali 2020-11-26 03:48:13 +02:00
  • 751e13a41e [docs] redirect to discourse (#12427) Richard Liaw 2020-11-25 17:10:10 -08:00
  • 323941c745 [tune] fix pbt flakey test (#12418) Richard Liaw 2020-11-25 16:58:37 -08:00
  • f6a5b733d5 Remove flaky object manager test that's no longer needed Eric Liang 2020-11-25 12:45:47 -08:00
  • 679492a235 [serve] Use Long Polling in Backend Worker (#12093) Ian Rodney 2020-11-25 12:11:38 -08:00
  • ca6c2b2442 [docker] Use cuDNN7, not 8 (#12375) Ian Rodney 2020-11-25 12:06:53 -08:00
  • 753cda2f28 [Dashboard] Delete old dashboard (#12144) SangBin Cho 2020-11-25 11:31:02 -08:00
  • b7dbbfbf41 [RLlib] Issue 11591: SAC loss does not use PR-weights in critic loss term. (#12394) Sven Mika 2020-11-25 20:28:46 +01:00
  • 592c161032 [RLlib] Issue 12118: LSTM prev-a/r should be separately configurable. Fix missing prev-a one-hot encoding. (#12397) Sven Mika 2020-11-25 20:27:46 +01:00
  • 841d93d366 [RLlib] Issue 12233 shared tf layers example not really shared (only works for tf1.x, not tf2.x). (#12399) Sven Mika 2020-11-25 20:27:19 +01:00
  • 95175a822f [RLlib] Issue 11974: Traj view API next-action (shift=+1) not working. (#12407) Sven Mika 2020-11-25 20:26:29 +01:00
  • 2e95552f0c [Dashboard] Defensive change to make sure we do not iterate over "None" in the case that workers is not present in node physical stats for a given node (#12358) Max Fitton 2020-11-25 11:06:45 -08:00
  • dc55f6ba3a skip gcs fault tolerance test for the time being when new scheduler is enabled (#12393) ZhuSenlin 2020-11-26 02:40:47 +08:00
  • 2b293832e7 [Docker][minor] Update path in base-deps Dockerfile (#12391) Gekho457 2020-11-25 13:30:10 -05:00
  • 2e4e285ef0 [Object Spilling] Fusion small objects (#12087) SangBin Cho 2020-11-25 10:13:32 -08:00
  • 09d5413f70 [RLlib] rollout batch, handle rewards that are None (unknown) in a multi-agent env (#11858) (#11911) karstenddwx 2020-11-25 13:39:22 +01:00
  • c009c178f6 [RLlib] Closes #11924: Add support for custom/ray environments in rollouts.py for agents without workers (#11926) danuo 2020-11-25 08:43:17 +01:00
  • 82852f0ed2 [RLlib] Add ResetOnExceptionWrapper with tests for unstable 3rd party envs (#12353) Tomasz Wrona 2020-11-25 08:41:58 +01:00
  • c5845c3a4e [docker] Docker stop on each node (#12357) Ian Rodney 2020-11-24 23:15:53 -08:00
  • 20eb217c55 Fix a zsh compatiblity issue in java/BUILD.bazel (#12199) Hao Chen 2020-11-25 14:49:49 +08:00
  • 4066056a0d [ray_client] Support calling functions from other functions and correct the tests (#12141) Barak Michener 2020-11-24 22:19:20 -08:00
  • 4dd0aa7822 [GCS]make thread number of gcs rpc server configurable (#12257) Tao Wang 2020-11-25 11:40:29 +08:00
  • 5d47d02f81 [GCS]add callback for RegisterSelf api, make it done first (#12252) Tao Wang 2020-11-25 11:36:44 +08:00
  • e025b9e788 [TEST]Move all WaitReady together (#12254) Tao Wang 2020-11-25 11:21:24 +08:00
  • 2af10c1b78 [GCS]Add new message ReportResourceUsage (#11848) Tao Wang 2020-11-25 11:18:26 +08:00
  • e1075c0a82 [GCS]Fill resource fields when re-report heartbeat after gcs restarted (#12097) Tao Wang 2020-11-25 11:07:02 +08:00
  • 1d909321c9 [PlacementGroup]Fix node manager release unused bundles bug (#12346) fangfengbin 2020-11-25 11:02:43 +08:00
  • 5934b20b96 [PlacementGroup]Fix destroy bundle resources bug (#12336) fangfengbin 2020-11-25 09:45:26 +08:00
  • 9f322db71d Add many_ppo long running test (#12364) Eric Liang 2020-11-24 16:00:33 -08:00
  • dae137b919 Don't allow 'optional' files in setup.py (#12359) Edward Oakes 2020-11-24 17:41:58 -06:00
  • 4afaa46028 [RLlib] Increase the scope of RLlib's regression tests. (#12200) Sven Mika 2020-11-24 22:18:31 +01:00
  • 5895554555 [autoscaler] Raise node "start" deadline to 900s, make configurable (#12316) Eric Liang 2020-11-24 12:16:01 -08:00
  • 4ada3e4c99 [serve] Incremental change towards async control loop for replica startup (#12281) Edward Oakes 2020-11-24 13:06:08 -06:00
  • 888357d251 added address resolution fix for running in docker containers (#11944) roireshef 2020-11-24 20:34:56 +02:00
  • be0fa7b8b4 Properly specify kubectl-rsync.sh in setup.py (#12356) Edward Oakes 2020-11-24 12:13:29 -06:00
  • e66ddab190 [autoscaler/k8s] Handle unavailable k8s API (#12283) Gekho457 2020-11-24 12:13:15 -05:00
  • 462c7fb575 [streaming] export aligned_ symbols from raylet.so (#12345) Lixin Wei 2020-11-25 00:16:12 +08:00
  • 1ae4d2873a [GCS] refactor gcs initialization (#11890) ZhuSenlin 2020-11-24 21:11:18 +08:00
  • be7938ee09 [PlacementGroup]Fix AddBundleLocations bug (#12330) fangfengbin 2020-11-24 16:57:17 +08:00
  • 2c4514a2c0 [minor] Refactor to expose RedisContext::PingPort (#12022) dHannasch 2020-11-23 21:39:50 -07:00
  • 40cbc638ee A bit more explanation about worker ports (#12185) dHannasch 2020-11-23 21:24:15 -07:00
  • b94bfdfa99 [tune] use default anonymous metric _metric if at least a mode is set (#12159) Kai Fricke 2020-11-24 05:09:33 +01:00
  • 135f2e0602 [serve] Only use typed BackendConfig in controller (#12275) Edward Oakes 2020-11-23 20:56:04 -06:00
  • be2187f942 Remove use_legacy_config tests (#12272) Edward Oakes 2020-11-23 20:30:50 -06:00
  • 1b3c48b922 Remove unused _post_get_hooks (#12270) Edward Oakes 2020-11-23 20:30:30 -06:00
  • a93ed4583d Remove unused print_failed_task (#12267) Edward Oakes 2020-11-23 16:33:10 -06:00
  • e59fe65d3d [tune] Fix logging for dockersyncer (#12196) Richard Liaw 2020-11-23 14:29:41 -08:00
  • 4e43650851 Remove deprecated f._submit API (#12268) Edward Oakes 2020-11-23 16:20:22 -06:00
  • 822d8eb535 Removed double "install" statement (#12261) Tomasz Wrona 2020-11-23 22:36:01 +01:00
  • 6346ea0be6 [tune] Allow None values in TBX logger (#12262) Sumanth Ratna 2020-11-23 15:57:23 -05:00
  • c816a44667 [docs] Insert missing article in sentence (#12245) Max Fan 2020-11-23 15:53:11 -05:00
  • ddd7a18cb9 [docs] Fix dead links to cluster docs by making dashes slashes (#12260) Bart Broere 2020-11-23 21:52:36 +01:00
  • 4c4f189f97 [tune] nevergrad add points_to_evaluate (#12207) viotemp1 2020-11-23 22:51:04 +02:00
  • 1cf95cb081 [Serve] Remove old report-queue-code (#12211) Ian Rodney 2020-11-23 12:42:05 -08:00
  • 7f1f16d99e [Core] Force get_actor(name)'s name to be non-empty string (#12218) Ian Rodney 2020-11-23 12:38:16 -08:00
  • 32d159a2ed Fix release directory & RELEASE_PROCESS.md (#12269) Edward Oakes 2020-11-23 14:28:59 -06:00
  • 40428c9b05 [tune] Make tf distributed testing smaller (#12173) Richard Liaw 2020-11-23 12:15:10 -08:00
  • c99c376d66 Remove unused logging job id field (#12266) Edward Oakes 2020-11-23 13:00:15 -06:00
  • 084f03797b [Placement Group]Placement Group supports gcs failover(Part3) (#12036) fangfengbin 2020-11-23 16:57:58 +08:00
  • 9d133e874c [Java] support java actor class inheritance (#12001) chaokunyang 2020-11-23 14:07:10 +08:00
  • 643c07e451 [tune] update param & install hints for Search Algos (#12242) Kaushik B 2020-11-23 05:09:41 +05:30
  • eb89c567e2 [Metrics] Skip prom file test on Windows. (#12234) SangBin Cho 2020-11-22 00:32:18 -08:00
  • 2a9499a1bd [autoscaler] cover autofilling resources with try except. (#12222) Ameer Haj Ali 2020-11-21 23:04:24 +02:00
  • 608d0378c4 [tune] Add test for infinite trials (#12156) Kai Fricke 2020-11-21 21:54:01 +01:00
  • 839517743d Support ray.* in remote functions for Ray client (#12177) Eric Liang 2020-11-20 13:28:46 -08:00
  • 48042be8bb [tune] Avoid dependency on Kubernetes (#12188) Richard Liaw 2020-11-20 13:01:20 -08:00
  • f6b84cb2f7 [RLlib] Fix offline logp vs prob bug in OffPolicyEstimator class. (#12158) Sven Mika 2020-11-20 08:59:43 +01:00
  • 6f2398d122 Fix lint crash (title underline too short) (#12193) dHannasch 2020-11-19 22:13:23 -07:00
  • d200f620ee Deflake test_router (#12175) Simon Mo 2020-11-19 18:37:46 -08:00
  • 5df9f07ff3 [CI] Use Docker image for microbenchmarks (#12189) Simon Mo 2020-11-19 17:54:40 -08:00
  • 4b2c5daf45 State which IP addresses are failing to match. (#11957) dHannasch 2020-11-19 18:25:25 -07:00