Commit Graph

617 Commits

Author SHA1 Message Date
Raoul Khouri 44a379ee9b [tune] fixed validation for search metrics (#11583)
* fixed validation for search metrics

* formatting

* made error report better

* if only one metric is missing extract it from list

* any can take a generator
2020-10-30 19:59:20 -07:00
Richard Liaw 395ddb093c [tune] a tiny ptl example (#11497) 2020-10-23 16:14:51 -07:00
Servon aeea168940 [tune] Update for ZOOpt (#11491)
Co-authored-by: Servon <zewen.li@polixir.ai>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-10-20 23:56:20 -07:00
Kai Fricke 6d11fb8bc6 [tune] validate function callable in tune.with_parameters() (#11504) 2020-10-20 16:03:24 -07:00
Kai Fricke ed81010b0d [tune] support hierarchical search spaces for hyperopt (#11431)
* support hierarchical search spaces for hyperopt

* Reduce num samples

* Fix prefix
2020-10-19 16:00:08 -07:00
Servon 202b1859ef [tune] Update ZOOpt to better support the latest Ray (#11462)
Co-authored-by: Servon <zewen.li@polixir.ai>
2020-10-19 09:28:11 -07:00
Sumanth Ratna 60a4be4a59 [tune] Remove metric and mode kwargs from create_searcher (#11335) 2020-10-14 21:44:36 -07:00
Sumanth Ratna 3fe757391b [tune] Add Basic Variant Generator to search algorithm shim function (#11334)
* Add Basic Variant Generator

* Add 'random' key to SEARCH_ALG_IMPORT

Co-authored-by: Kai Fricke <kai@anyscale.com>

Co-authored-by: Kai Fricke <kai@anyscale.com>
2020-10-14 21:41:47 -07:00
Vishnu Deva 00e0f14c6f [tune] restore trials when sync_on_checkpoint is False (#11355)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-10-14 19:09:34 -07:00
Kai Fricke f7120d2a18 [tune] Make metrics parameter optional in pytorch lightning integration (#11402) 2020-10-14 17:50:34 -07:00
Amog Kamsetty 933cf6675c [Tune] Changes for Pytorch Lightning 1.0 (#11375) 2020-10-13 15:50:11 -07:00
Richard Liaw 56f858ed1a [tune][docs/util] gputil check, docs (#11260)
Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>
2020-10-10 00:54:31 -07:00
Kai Fricke b450cb030a [tune] reuse actors for function API (#11230)
Co-authored-by: Kristian Hartikainen <kristian.hartikainen@gmail.com>
2020-10-08 16:15:02 -07:00
Thomas Tumiel 587319debc [tune] move _SCHEDULERS to tune.schedulers and add all available schedulers (#11218)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-10-08 16:10:23 -07:00
Richard Liaw 74e9647ec3 [tune] deactivate flaky test for now (#11284) 2020-10-08 15:40:04 -07:00
Amog Kamsetty 1027bfd4b8 [Tune, Ray SGD] Update PTL integrations (#11271) 2020-10-08 13:43:07 -07:00
scottwedge 732cd9901b Fix spelling of occurred (#10792) 2020-10-08 10:55:52 -07:00
Sumanth Ratna 14d8826e43 Fix overriden typo (#11227) 2020-10-07 19:11:07 -07:00
Kai Fricke e58613b5e5 [tune/dashboard] Fix Tune dashboard to work with all trainables (#11232) 2020-10-06 14:03:31 -07:00
Sven Mika ce96b03b07 [RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033) 2020-10-06 20:28:16 +02:00
Sumanth Ratna 6288d954b5 [tune] Use isinstance instead of type in PBT (#11175) 2020-10-02 18:32:14 -07:00
Richard Liaw 9dc7b7b11d [tune] enable ax test (#11165)
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2020-10-01 17:25:29 -07:00
Richard Liaw f60d77d78e [tune] Fix GPU thread leakage 2020-10-01 17:07:13 -07:00
Kai Fricke 681c24754a [tune] Example for using huggingface hyperparamer_search API (#11158) 2020-10-01 16:00:57 -07:00
Kai Fricke bdf647c4ec [tune] docker syncer (#11035)
* Add DockerSyncer

* Add docs

* Update python/ray/tune/integration/docker.py

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Updated docs

* fix dir

* Added docker integration test

* added docker integration test to bazel build

* Use sdk.rsync API

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-10-01 11:59:23 -07:00
Richard Liaw b969e8cf59 [tune][hotfix] comment out test (#11140) 2020-10-01 10:04:40 -07:00
Kai Fricke c77cfaa5ad [tune] use dated experiment dir per default (#11104) 2020-09-30 14:43:59 -07:00
SangBin Cho e44828d928 [Test] Make the test size bigger for tune to fix timeout. (#11138) 2020-09-30 11:35:31 -07:00
Kai Fricke 10015e60fb [tune] fix flaky callback test (#11102) 2020-09-29 09:44:32 -07:00
Kai Fricke e7315b0856 [tune] Callbacks for tune runs (#11001) 2020-09-27 16:50:07 -07:00
Eric Liang 609c1b8acd Start moving ray internal files to _private module (#10994) 2020-09-24 22:46:35 -07:00
Amog Kamsetty ee85cb31a5 [Tune] Fix Memory Leak (#10989)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-09-24 20:26:55 -07:00
Kai Fricke d9c4dea7cf [tune] strict metric checking (#10972) 2020-09-24 10:00:48 -07:00
Kai Fricke 5921e87ecd [tune] Only add new trial when there is no pending trial (#10979) 2020-09-23 11:08:12 -07:00
Kai Fricke 2d16ab2e16 [tune] Remove unnecessary wandb group parameter (#10950) 2020-09-22 09:36:51 -07:00
Sumanth Ratna 770c3633f0 Update max_failures kwarg docstring (#10953) 2020-09-22 09:02:15 -07:00
Kai Fricke 6247740b94 [tune] sort running trials to top in status table (#10926)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-09-21 13:39:51 -07:00
Kai Fricke 50d63b8077 [tune] update pt tutorial docs (#10925)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-09-21 13:33:37 -07:00
Eric Liang 6a227ae501 [autoscaler] Split autoscaler interface public private (#10898) 2020-09-18 18:16:23 -07:00
Kai Fricke 508cfa3540 [tune] Support yield and return statements (#10857)
* Support `yield` and `return` statements in Tune trainable functions

* Support anonymous metric with ``tune.report(value)``

* Raise on invalid return/yield value

* Fix end to end reporter test
2020-09-17 20:18:35 -07:00
Kai Fricke ee99c919e3 [tune] lazy trials (#10802)
* Lazily fill trial queue

* Update interface

* Update end to end reporter test

* Removed `next_trials()` method

* Lint

* Print total number of samples to be generated in progress reporter. Allow infinite samples.

* Nit check
2020-09-17 08:51:46 -07:00
Richard Liaw d3feb83053 [tune] check for running session (#10840) 2020-09-16 18:55:11 -07:00
Kai Fricke c9fafe7733 [tune] added type hints (#10806)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-09-15 21:03:56 -07:00
Amog Kamsetty d5a7c53908 [Ray SGD] use_local flag + Worker group abstraction (#10539)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-09-15 11:58:57 -07:00
Kai Fricke 0865d68466 [tune] convert fallback representation to numbers in wandb integration (#10799) 2020-09-15 11:47:11 -07:00
Ameer Haj Ali 6edacb22b8 Fix abstraction violations in command_runner interface (#10715)
* Fix abstraction violations in command_runner interface

* user guide

* lint

* breaking abstraction in commands

* extra initialization commands

* more cleanup

* small fixes

* fix test_integration_kubernetes.py

* lint

Co-authored-by: root <root@ip-172-31-28-155.us-west-2.compute.internal>
Co-authored-by: Ameer Haj Ali <ameerhajali@Ameers-MacBook-Pro.local>
2020-09-14 20:28:38 -07:00
Ian Rodney 5bc2ba38fd [docker] Detect CPUs in container correctly (#10507)
Co-authored-by: simon-mo <simon.mo@hey.com>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
Co-authored-by: Alex Wu <itswu.alex@gmail.com>
2020-09-13 23:40:48 -07:00
Richard Liaw 6662efd74b [tune/rllib] revert removal of queue-trials (#10744) 2020-09-11 14:13:20 -07:00
Kai Fricke 7eaf063f29 [tune] wrapper function to pass arbitrary objects through the object store to trainables (#10679) 2020-09-10 17:39:44 -07:00
Kai Fricke 67bf396ae7 [tune] wandb log cleaning to use yaml representer (#10680)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-09-10 00:07:16 -07:00