SangBin Cho
f6f3cc9af1
[Core]Remove checkpoint table ( #12235 )
...
* Delete an actor entry from node manager.
* Remove checkpoint table
* remote checkpoint interface
* remove checkpoint interface
* fix ExitActorTest
Co-authored-by: chaokunyang <shawn.ck.yang@gmail.com >
2020-12-01 08:58:36 -08:00
Tao Wang
b85c6abc3e
Rename fields/variables from client id to node id ( #12457 )
2020-11-30 14:33:36 +08:00
Eric Liang
380df89069
Lazily initialize the global state accessor in Python workers ( #12054 )
...
* wip
* fix
* fix
2020-11-16 21:35:12 -08:00
Kishan Sagathiya
c5e6c90e1e
[Core] Add name of actor in the result of ray.actors() ( #11828 )
...
Added name field to `actor_info`
Fixes #11112
2020-11-06 10:45:44 -08:00
DK.Pino
b10871a1f5
[Core]Fix get workrer table bug ( #11516 )
...
* fix get_worker_table bug
* fix lint
* fix comment
* remove actor table
* fix comment
* fix get alive worker
* remove unused python import
2020-10-30 14:48:29 -07:00
DK.Pino
9f804ade5f
[Placement Group]Add get all placement group api ( #11460 )
...
* add get all interface for placement group
* add get all interface for placement group
* make it work
* fix lint
* fix lint
* fix comment
* add cpp test
* fix python lint
2020-10-23 11:46:48 -07:00
Tao Wang
1db83764bf
[GCS]Use new getting all available resources interface instead of pub-sub … ( #10914 )
...
* Use new all available resources getting interface instead of pub-sub in state.py
* add missing server handler and test cases, fix comments
* add fine grained test assert
* per comments
* involve new added function _available_resources_per_node
* change ClientID to NodeID
* fix compile
* fix client id and lint
* robust tests check
* robust tests
2020-09-29 09:41:10 -07:00
DK.Pino
db7097fb1f
[Refactor] Rename ClientId to NodeId ( #10992 )
...
* rename ClientId to NodeId
* format lint
* format lint
* fix conflicts
* rename new ClientId to NodeId
* update lint
* make same version of clang-format with travis ci
2020-09-27 10:24:21 -07:00
Eric Liang
609c1b8acd
Start moving ray internal files to _private module ( #10994 )
2020-09-24 22:46:35 -07:00
Simon Mo
4f6e218a3d
Add a new _available_resources_per_node for state API ( #11014 )
2020-09-24 17:25:15 -07:00
SangBin Cho
5e6b887f2d
[Placement Group] Capture Child Task Part 1 ( #10968 )
...
* In progress.
* In progers.
* Done.
* Addressed code review.
* Increase timeout to make a test less flaky.
* Addressed code review.
* Addressed code review.
2020-09-24 09:02:03 -07:00
SangBin Cho
1fdb7ef6c3
[docs] Placement group documentation ( #10555 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu >
2020-09-16 16:07:55 -07:00
Eric Liang
2a204260a8
[api] Second round of 1.0 API changes: exceptions, num_return_vals ( #10377 )
2020-08-28 19:57:02 -07:00
Eric Liang
519354a39a
[api] Initial API deprecations for Ray 1.0 ( #10325 )
2020-08-28 15:03:50 -07:00
Lixin Wei
fe6daef85e
[Core]Add runtime context for python worker ( #10309 )
...
* add runtime context for python
* fixed
* code fixed
* test added
* lint
* lint
2020-08-26 20:11:42 -07:00
Robert Nishihara
79eefbf357
Better checking that ray.init() has been called. ( #10261 )
2020-08-25 17:13:11 -07:00
SangBin Cho
1f54acd274
[Tech Debt] Use f-string for python/ray/*.py ( #10268 )
...
* In progress.
* Done with critical path.
* Modified cluster_utils.py and log_monitor.py
* Addressed code review.
2020-08-23 22:01:31 -07:00
SangBin Cho
92664249e8
Partially Use f string ( #10218 )
...
* flynt. trial 1.
* Trial 1.
* Addressed code review.
2020-08-20 18:21:16 -07:00
SangBin Cho
224933b5e4
[Placement Group] Remove API part 2 ( #10215 )
...
* Initial progress done.
* Fix mistake.
* Addressed code review.
* Fix cpp build issue.
* Addressed code review.
2020-08-20 09:50:13 -07:00
SangBin Cho
263df6163c
[Placement Group] Placement group remove api part 1 ( #10063 )
...
* Added basic rpc calls.
* fix issues.
* Fix the gcs server not getting request issue.
* In Progress.
* Basic logic done. Tests are required.
* In progress.
* In progress in refactoring context.
* Revert "In progress in refactoring context."
This reverts commit 38236256cf1306c60dd203e75d45ceb4509c8106.
* Working now.
* Python test works.
* Lint.
* Addressed code review.
* Addressed code review.
* Lint.
* Added unit tests.
* Done, but one of unit tests fail
* Addressed code review.
* Addressed the last code review.
* Fix the wrong test case.
2020-08-18 12:44:00 -07:00
SangBin Cho
053188dfbe
[Placement Group] Support Placement Group state table. ( #10090 )
...
* Done.
* Addressed code review.
* Linting.
* Fix lint.
* Fix lint.
* Fix a test.
* Lint.
* Add a lint sleep to test.
* Fix the lint issue.
* Fixed doc build error.
2020-08-17 09:24:50 -07:00
SangBin Cho
ec2f1a225e
[Stats] Metrics Export User Interface Part 1 ( #9913 )
...
* Metrics export port expose done.
* Support exposing metrics port + metrics agent service discovery through ray.nodes()
* Formatting.
* Added a doc.
* Linting.
* Change the location of metrics agent port.
* Addressed code review.
* Addressed code review.
2020-08-06 16:16:29 -07:00
kisuke95
28b1f7710c
[Core] Error info pubsub (Remove ray.errors API) ( #9665 )
2020-08-04 14:04:29 +08:00
Alisa
51e12ee97c
Python api of placement group ( #9243 )
2020-07-27 14:57:05 -07:00
SangBin Cho
22b2e51152
Fix test-multi-node ( #9453 )
2020-07-13 20:44:27 -05:00
Hao Chen
d49dadf891
Change Python's ObjectID to ObjectRef ( #9353 )
2020-07-10 17:49:04 +08:00
ChenZhilei
6f3d993681
GCS server use worker table to handle RegisterWorker instead of redis accessor ( #9168 )
2020-07-06 10:37:25 +08:00
Tao Wang
f7ac495a68
[Core] Use map instead of list to represent resources in heartbeat message ( #9294 )
2020-07-05 10:59:25 +08:00
SangBin Cho
7c43991100
[GCS] Monitor.py bug fix ( #8725 )
...
* comment.
* Fix bugs.
* Used pubsub message instead.
* Added a ray.actors test
2020-06-02 16:06:36 -07:00
Lingxuan Zuo
4cbbc15ca7
[GCS] Global state accessor from node resource table ( #8658 )
2020-06-02 14:01:00 +08:00
fangfengbin
016337d4eb
Heartbeat table uses gcs pub-sub instead of redis accessor ( #8655 )
2020-05-30 23:17:25 +08:00
fangfengbin
35eeec5647
Add C++ global state for actor table ( #8501 )
...
* add global state actors
* fix code style
* fix GcsActorManagerTest bug
* rebase master
* add jni code
* add get checkpoint id code
* add debug code
* add debug code
* change log level
* fix compile bug
* return null in jni
* fix crash bug
* change import seq
Co-authored-by: 灵洵 <fengbin.ffb@antfin.com >
Co-authored-by: Hao Chen <chenh1024@gmail.com >
2020-05-29 21:10:42 +08:00
Lingxuan Zuo
e594524ed3
[GCS] global state query node info table from GCS. ( #8498 )
2020-05-28 16:39:13 +08:00
Ujval Misra
e958d261b6
Fix ray.available_resources bug ( #8537 )
2020-05-27 17:55:08 -07:00
fangfengbin
765d470c40
Add gcs object manager ( #8298 )
2020-05-25 17:21:35 +08:00
Tao Wang
92c2e41dfd
[GCS]profile info getting implementation based gcs service ( #8536 )
2020-05-24 22:23:01 +08:00
fangfengbin
9347a5d10c
Add global state accessor of jobs ( #8401 )
2020-05-18 20:32:05 +08:00
Edward Oakes
16f48078d9
Remove use of ObjectID transport flag ( #7699 )
2020-05-17 11:29:49 -05:00
Edward Oakes
793e616a2d
Fix job table parsing ( #8070 )
2020-04-20 12:56:43 -05:00
Robert Nishihara
b011c604d7
Remove ray.tasks() from API. ( #7807 )
2020-04-01 10:10:40 -05:00
Edward Oakes
9318b29f5e
Remove is_direct logic from the raylet ( #7698 )
2020-03-23 17:09:35 -05:00
Simon Mo
31d63d3ca7
Fix global state actors() call ( #7567 )
2020-03-11 16:59:50 -07:00
Edward Oakes
7b609ca211
Remove instances of 'raise Exception' ( #7523 )
2020-03-10 17:51:22 -07:00
fyrestone
0648bd28ef
[xlang] Cross language Python support ( #6709 )
2020-02-08 13:01:28 +08:00
Yunzhi Zhang
3acf3c7675
[Dashboard] Add actor task counter ( #6820 )
2020-01-17 15:43:56 -08:00
Sven
60d4d5e1aa
Remove future imports ( #6724 )
...
* Remove all __future__ imports from RLlib.
* Remove (object) again from tf_run_builder.py::TFRunBuilder.
* Fix 2xLINT warnings.
* Fix broken appo_policy import (must be appo_tf_policy)
* Remove future imports from all other ray files (not just RLlib).
* Remove future imports from all other ray files (not just RLlib).
* Remove future import blocks that contain `unicode_literals` as well.
Revert appo_tf_policy.py to appo_policy.py (belongs to another PR).
* Add two empty lines before Schedule class.
* Put back __future__ imports into determine_tests_to_run.py. Fails otherwise on a py2/print related error.
2020-01-09 00:15:48 -08:00
Robert Nishihara
39a3459886
Remove (object) from class declarations. ( #6658 )
2020-01-02 17:42:13 -08:00
Yunzhi Zhang
8a0a30b5f0
[Dashboard] display actor status and infeasible tasks ( #6652 )
...
* expose actor status and protobuf message of infeasible tasks
* move infeasible tasks into actor tree
* add pytest for displaying infeasible tasks info
* fix base64 decoding
* fix race condition after #6629 merged
2020-01-02 14:27:59 -08:00
Robert Nishihara
9baa002069
Remove deprecated global state. ( #6655 )
2019-12-31 22:40:47 -08:00
Philipp Moritz
ecddaafd94
Add actor table to global state API ( #6629 )
2019-12-31 15:11:59 -08:00