Commit Graph

4884 Commits

Author SHA1 Message Date
Ian Rodney 2e972c2a77 RLLIB and pylintrc (#8995) 2020-06-17 18:14:25 +02:00
Ian Rodney 265ddfc2e4 blacklist to remove (#8994) 2020-06-17 18:02:28 +02:00
fangfengbin c295284370 Optimize gcs server resubscribe (#8896) 2020-06-17 20:05:50 +08:00
Joseph Suarez c6ee3cdff4 Refactor #8792 to integrate latest master (#8956) 2020-06-17 10:55:52 +02:00
Tao Wang 9f0f542660 Remove actor table info from storage when a driver exits (#8761)
* delete contents of table related to specified job when the job is dead

* check status

* implement GetByJobId in gcs table storage

* add test case

* add test case

* fix test cases

* expose MGET and make match_pattern only related with SCAN

* add test case for table storage

* delete checkpoint

* make MGetValues static

* add most test case

* add object test case

* avoid accessing to storage when get matched object ids per job id

* rename job info handler

* use listener to sense job finished

* clear actor state

* add comments, remove actions in task handler

* let raylet do object cleaning. only remove non-detached actors

* only remove informations of non-detached actor

* remove unused methods
2020-06-16 18:43:08 -07:00
Ian Rodney 069b121cc1 [docs] Remove Old warning about IOCTL (#8977) 2020-06-16 18:14:53 -07:00
Stephanie Wang fa16c7666a Fix possible deadlock in CoreWorkerDirectActorTaskSubmitter (#8973) 2020-06-16 15:30:15 -07:00
acmore fa0a677aac Customize service account name. (#8901) 2020-06-16 12:49:41 -05:00
fangfengbin 4facac023f Fix heap-use-after-free bug of gcs pub sub testcase (#8968) 2020-06-16 21:00:37 +08:00
Siyuan (Ryans) Zhuang b68fede30b Convert include guard to pragma once (#8957) 2020-06-16 01:29:43 -07:00
chaokunyang cb6f337372 [Java] Refine python function (#8943) 2020-06-16 16:22:49 +08:00
Sven Mika 14405b90d5 [RLlib] Prototype of a DynaTrainer (for env dynamics learning in upcoming MBMPO algo). (#8860) 2020-06-16 09:01:20 +02:00
Sven Mika 7008902cff [RLlib] Minor rllib.utils cleanup. (#8932) 2020-06-16 08:52:20 +02:00
Sven Mika 0c7764b010 Issue 8919 checkpoint at end ignored (#8933) 2020-06-16 08:51:20 +02:00
Sven Mika bdf1404a5f [RLlib] Issue 8714: QMIX init error w/ tuple obs space. (#8936) 2020-06-16 08:50:53 +02:00
henktillman 508149b3c3 Remove redundant logger warning (#8954) 2020-06-15 21:14:58 -07:00
Simon Mo 1a1ddc74c4 [Serve] Add package reference and links keyword to docstring (#8955) 2020-06-15 18:47:59 -07:00
SangBin Cho 3d1b8c24fd [Dashboard] Dashboard pubsub hotfix. (#8944) 2020-06-15 20:38:56 -05:00
mehrdadn 4afa2b304a Clean up CI ASAN & .bazelrc (#8828) 2020-06-15 17:27:17 -07:00
Stephanie Wang 19d44d4fa9 Use no_restart=False for ray.kill in Serve failure test (#8952) 2020-06-15 15:34:56 -07:00
Max Fitton 4a66b6783a Logical View: Restructuring, tooltips, and QoL changes (#8916) 2020-06-15 16:09:29 -05:00
Max Fitton ddb9368f2c Display GPU Utilization in the Dashboard (#8564) 2020-06-15 15:27:44 -05:00
Richard Liaw 6c49c01837 [tune] Function API checkpointing (#8471)
Co-authored-by: krfricke <krfricke@users.noreply.github.com>
2020-06-15 10:42:54 -07:00
Scott Graham 91e57f2e53 [azure] default workers spot instances + billing profile (#8938) 2020-06-15 10:35:20 -07:00
SangBin Cho 3ca0e6f636 Update incorrect detached actor docs (#8930) 2020-06-15 12:31:02 -05:00
SongGuyang 1583cd14ef Add interfaces for C++ worker cluster mode (#8859) 2020-06-14 19:13:19 -07:00
Jack Carreira 19cc1ae781 [docs] Tune Search: Wrong parameter name (#8927) 2020-06-13 18:01:22 -07:00
Sven Mika 4ed796a7d6 [RLlib] Add testing Policy.compute_single_action() for all agents. (#8903) 2020-06-13 17:51:50 +02:00
mehrdadn 101c215125 Get more tests running on Windows (#6537)
* Get rid of system() calls

* Work around '/usr/share/mini' showing up on GitHub Actions (probably due to psutil truncation)

https://github.com/ray-project/ray/runs/722480047?check_suite_focus=true

* Don't check for socket max path length on Windows

* Don't check for socket existence on Windows

* Fix race condition in Windows fate-sharing

* Work around missing .exe extension for Redis tests

* Add more tests to GitHub Actions

Co-authored-by: Mehrdad <noreply@github.com>
2020-06-12 21:32:10 -07:00
Eric Liang 34bae27ac7 [rllib] Flexible multi-agent replay modes and replay_sequence_length (#8893) 2020-06-12 20:17:27 -07:00
Eli Meirom 5c56760fac [tune] np.array compat for logger (#8918)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-06-12 16:39:01 -07:00
Ian Rodney 0e82f0d7c3 [autoscaler] Create Docker Command Runner (v2) (#8840)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-06-12 16:38:38 -07:00
Siyuan (Ryans) Zhuang ed77c8b16c [Core] Use global variable to eliminate force thread termination in plasma (#8912)
* use global variable to eliminate force thread termination
2020-06-12 14:20:53 -07:00
Richard Liaw 58efec0f2b [sgd] simplify cuda visible device setting (#8775) 2020-06-12 13:53:32 -07:00
mehrdadn 07637e5b5b Upgrade Bazel and add required patches (#8847) 2020-06-12 14:59:22 -05:00
Siyuan (Ryans) Zhuang 4b31b383f3 [Core] Run Plasma Store as a Raylet thread (with a feature flag) (#8897)
* integrate plasma store as a thread (C++)

* integrate plasma store as a thread (Python)

* fix config issues

* remove plasma component fail tests

* without forcefully kill the plasma store thread
2020-06-11 22:54:08 -07:00
chaokunyang dfa4768fc6 [Java] Refactor java api (#8858) 2020-06-12 10:49:01 +08:00
mehrdadn cae475c46a Fix Windows build (#8905)
Co-authored-by: Mehrdad <noreply@github.com>
2020-06-11 14:54:37 -07:00
krfricke 060e524c92 [tune] Parameter columns can now be specified in tune reporters (#8802)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
Co-authored-by: Kai Fricke <kai@anyscale.com>
2020-06-11 11:30:25 -07:00
Richard Liaw e4a1cda0ad [tune][hotfix] fix links (#8904) 2020-06-11 11:29:16 -07:00
Richard Liaw 91d55f52ab [docs] Why Tune? (#8790)
Co-authored-by: Sumanth Ratna <sumanthratna@gmail.com>
2020-06-11 11:23:36 -07:00
Sven Mika 8d1ccfd0f7 [RLlib] Issue 8889: action clipping bug ppo not learning mujoco (#8898) 2020-06-11 19:17:43 +02:00
Sven Mika a90cd0fcbb [RLlib] Unity3d soccer benchmarks (#8834) 2020-06-11 14:29:57 +02:00
internetcoffeephone 9166e22085 Add doc explanation about synchronous algorithm shared GPU utilization between workers and driver. (#8400) 2020-06-11 01:06:04 -07:00
Kristian Holsheimer ea965d7c52 [RLlib] use Mapping instead of dict in summarize() to accommodate non-dict grads/params (e.g. haiku's frozendict) (#8793) 2020-06-11 00:37:15 -07:00
chaokunyang 700d81fa20 [Java] Remove java api sub package from test module (#8853) 2020-06-11 14:59:45 +08:00
Dean Wampler 53712d2ef7 Fix typo in docs for LinearDiscreteEnv (#8891) 2020-06-11 08:34:35 +02:00
Stephanie Wang 05010caed2 [core] Fix race condition for object reconstruction (#8791)
* Fix

* doc

* Unit test

* Update src/ray/core_worker/task_manager.h

Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>

* Update src/ray/core_worker/task_manager.h

Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>

* Update src/ray/core_worker/task_manager.h

Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>

* lint

Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
2020-06-10 19:49:12 -07:00
Edward Oakes 527b0380c9 [serve] Add microbenchmark script (#8887) 2020-06-10 21:28:52 -05:00
Edward Oakes 3a9f45c4b3 [serve] Fix worker batch queue waiting logic (#8884) 2020-06-10 21:28:16 -05:00