Commit Graph

14 Commits

Author SHA1 Message Date
Simon Mo f596113fc7 [Core] Actor Retries Out of Order Tasks on Restart (#12338) 2020-12-01 09:35:54 -08:00
Eric Liang 2e5e9b291b [ray 1.0] Make private a few more things (#10551)
* reomve some more stuff

* no underscore
2020-09-04 17:19:27 -07:00
Eric Liang 2a204260a8 [api] Second round of 1.0 API changes: exceptions, num_return_vals (#10377) 2020-08-28 19:57:02 -07:00
Simon Mo 6b93ad11d0 [Doc] Add Architecture Doc for Ray Serve (#10204) 2020-08-20 11:40:47 -07:00
Stephanie Wang f2705e2c73 [core] Enable object reconstruction for retryable actor tasks (#9557)
* Test actor plasma reconstruction

* Allow resubmission of actor tasks

* doc

* Test for actor constructor

* Kill PID before removing node

* Kill pid before node
2020-07-23 21:15:12 -07:00
Stephanie Wang b351d13940 [core] Add flag to enable object reconstruction during ray start (#9488)
* Add flag

* doc

* Fix tests
2020-07-17 10:13:14 -07:00
Stephanie Wang bd169749e0 Option to retry failed actor tasks (#8330)
* Python

* Consolidate state in the direct actor transport, set the caller starts at

* todo

* Remove unused

* Update and unit tests

* Doc

* Remove unused

* doc

* Remove debug

* Update src/ray/core_worker/transport/direct_actor_transport.h

Co-authored-by: Eric Liang <ekhliang@gmail.com>

* Update src/ray/core_worker/transport/direct_actor_transport.cc

Co-authored-by: Eric Liang <ekhliang@gmail.com>

* lint and fix build

* Update

* Fix build

* Fix tests

* Unit test for max_task_retries=0

* Fix java?

* Fix bad test

* Cross language fix

* fix java

Co-authored-by: Eric Liang <ekhliang@gmail.com>
2020-05-15 20:15:15 -07:00
Max Fitton 00325eb2b2 Rename max_reconstructions to max_restarts and use -1 for infinite (#8274)
Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
2020-05-14 10:30:29 -05:00
Robert Nishihara 5e43b25e8c Document fault tolerance behavior. (#6698) 2020-01-06 22:34:06 -08:00
Richard Liaw 411f30c125 [docs] Second push of changes (#5391) 2019-08-28 17:54:15 -07:00
Robert Nishihara 57d6e98302 Update actor fault tolerance documentation. (#3175) 2018-11-01 11:52:05 -07:00
Robert Nishihara 658c14282c Remove legacy Ray code. (#3121)
* Remove legacy Ray code.

* Fix cmake and simplify monitor.

* Fix linting

* Updates

* Fix

* Implement some methods.

* Remove more plasma manager references.

* Fix

* Linting

* Fix

* Fix

* Make sure class IDs are strings.

* Some path fixes

* Fix

* Path fixes and update arrow

* Fixes.

* linting

* Fixes

* Java fixes

* Some java fixes

* TaskLanguage -> Language

* Minor

* Fix python test and remove unused method signature.

* Fix java tests

* Fix jenkins tests

* Remove commented out code.
2018-10-26 13:36:58 -07:00
Stephanie Wang fb4d991732 Add actor reconstruction limitations to documentation (#1452)
* Add limitations to actor fault tolerance

* rename heading
2018-01-23 13:40:50 -08:00
Robert Nishihara af71f9616e Initial fault tolerance documentation. (#845)
* Initial fault tolerance documentation.

* Update documentation.
2017-08-19 17:38:14 -07:00