Commit Graph

117 Commits

Author SHA1 Message Date
Richard Liaw 6c77aeb98a [docs] ray slack remove banners (#13898)
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2021-02-04 01:14:34 -08:00
Kai Fricke d29fcfb45c [tune] catch SIGINT signal and trigger experiment checkpoint (#13767)
* [tune] catch SIGINT signal and trigger experiment checkpoint

* Apply suggestions from code review

* Fix user guide docs

* Update doc/source/tune/user-guide.rst
2021-02-02 14:52:09 +01:00
architkulkarni 28cf5f91e3 [docs] change MLFlow to MLflow in docs (#13739) 2021-01-27 16:53:15 -08:00
Amog Kamsetty 20016c983f [Tune] MLflow Credentials (#13533) 2021-01-19 11:55:13 -08:00
Kai Fricke dc42abb2f5 [tune] placement group support (#13370) 2021-01-18 11:58:57 -08:00
Richard Liaw 86387504ee [tune] fix small docs typo (#13355)
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2021-01-16 00:49:17 -08:00
Kai Fricke 518427627b [tune] buffer trainable results (#13236)
* Working prototype

* Pass buffer length, fix tests

* Don't buffer per default

* Dispatch and process save in one go, added tests

* Fix tests

* Pass adaptive seconds to train_buffered, stop result processing after STOP decision

* Fix tests, add release test

* Update tests

* Added detailed logs for slow operations

* Update python/ray/tune/trial_runner.py

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Apply suggestions from code review

* Revert tests and go back to old tuning loop

* nit

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-01-12 18:52:47 +01:00
Edwin Goh a5ddc27bab Fix typo in Tune Docs (Checkpointing) (#13348)
See issue #13299
2021-01-11 20:27:18 -08:00
Amog Kamsetty 0452a3a435 [Tune] Rename MLFlow to MLflow (#13301) 2021-01-11 17:36:55 -08:00
Kai Fricke 97211a6170 [Tune] Fix tune serve integration example (#13233) 2021-01-06 17:02:04 +01:00
Lavanya Shukla 350917958c [docs] fix wandb url (#13094) 2020-12-28 17:19:17 -08:00
Antoni Baum a4f2dd2138 [Tune]Add integer loguniform support (#12994)
* Add integer quantization and loguniform support

* Fix hyperopt qloguniform not being np.log'd first

* Add tests, __init__

* Try to fix tests, better exceptions

* Tweak docstrings

* Type checks in SearchSpaceTest

* Update docs

* Lint, tests

* Update doc/source/tune/api_docs/search_space.rst

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
2020-12-23 09:27:16 -08:00
Amog Kamsetty 5d3c9c8861 [Tune] Mlflow Integration (#12840)
Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-12-19 00:40:02 -08:00
Kai Fricke 3d72000826 [tune] Add points_to_evaluate to BasicVariantGenerator (#12916)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-12-17 19:16:03 -08:00
Kai Fricke 5f04ade6ef [tune] add more stoppers and stopper documentation (#12750)
* Add new stoppers & docs

* Add tests for maximum iteration stopper and trial plateau stopper

* Update python/ray/tune/stopper.py

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Update doc/source/tune/api_docs/stoppers.rst

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Update doc/source/tune/api_docs/stoppers.rst

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Apply suggestions from code review

* Apply suggestions from code review

* Update python/ray/tune/stopper.py

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-12-12 01:47:19 -08:00
Richard Liaw 9ce7ad17fd [tune] remove some bottlenecks in trialrunner (#12476) 2020-11-30 14:54:25 -08:00
Richard Liaw 7c009d22cf [docs] Add xgboost_ray to docs (#12184)
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
2020-11-27 11:36:56 -08:00
Richard Liaw e59fe65d3d [tune] Fix logging for dockersyncer (#12196) 2020-11-23 14:29:41 -08:00
Kai Fricke 9f5986ee58 [tune] logger migration to ExperimentLogger classes (#11984) 2020-11-16 15:08:37 -08:00
Richard Liaw 8b3f79f307 [tune] refactor and add examples (#11931) 2020-11-14 20:43:28 -08:00
Keqiu Hu 0c1bdaef59 [tune] TensorFlow Distributed Trainable (#11876)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-11-10 14:59:08 -08:00
Kai Fricke 603accf1c2 [tune] logger refactor part 3: Add ExperimentLogger class (#11749) 2020-11-05 08:55:38 -08:00
Richard Liaw efa07d5403 Revert "Revert "[tune] PB2 (#11466)" (#11795)" (#11812) 2020-11-04 20:47:12 -08:00
Amog Kamsetty 7248d5f4ae Revert "[tune] PB2 (#11466)" (#11795)
This reverts commit e7aafd7d24.
2020-11-03 21:05:00 -08:00
Kai Fricke f7b19c41e3 [tune] logger refactor part 1: move classes and utilities to own files (#11746)
* [tune] logger refactor part 1: move classes and utilities to own files

* Fix circular dependency

* Remove uneeded pretty print copy

* Apply suggestions from code review
2020-11-03 07:48:09 -08:00
Jack Parker-Holder e7aafd7d24 [tune] PB2 (#11466)
Co-authored-by: Sumanth Ratna <sumanthratna@gmail.com>
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-10-27 01:03:21 -07:00
Richard Liaw b02e61f672 [minor] fix up docs (#11596)
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2020-10-26 12:19:03 -07:00
Richard Liaw 1b357533b1 [tune] Try to enable PTL, SKlearn tests (#11542) 2020-10-24 01:08:46 -07:00
Richard Liaw e7aa6441b7 [tune] a tiny ptl example (#11497) 2020-10-22 18:50:34 -07:00
Frank Gu 73fa94731f [tune] Add HDFS as Cloud Sync Client (#11524) 2020-10-22 14:12:51 -07:00
Richard Liaw a4b418d30c [docs] update cloud docs (#11262)
* update-cloud-docs

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* Update doc/source/cluster/config.rst

Co-authored-by: Ian Rodney <ian.rodney@gmail.com>

* fix

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* fix

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

Co-authored-by: Ian Rodney <ian.rodney@gmail.com>
2020-10-21 16:37:26 -07:00
Kai Fricke 2f74fe5b71 [tune/docs] Add PTL example to tune docs/examples (#11474) 2020-10-19 14:47:58 -07:00
Sumanth Ratna 92a58aabce [tune][docs] Fix learning rate bounds in FAQ (#11345) 2020-10-12 09:44:53 -07:00
Richard Liaw 56f858ed1a [tune][docs/util] gputil check, docs (#11260)
Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>
2020-10-10 00:54:31 -07:00
Kai Fricke b450cb030a [tune] reuse actors for function API (#11230)
Co-authored-by: Kristian Hartikainen <kristian.hartikainen@gmail.com>
2020-10-08 16:15:02 -07:00
Sumanth Ratna 14d8826e43 Fix overriden typo (#11227) 2020-10-07 19:11:07 -07:00
Amog Kamsetty 3b76def2d2 [Docs] [Tune] Add NeuroCard to open source projects using Tune (#11213) 2020-10-06 14:22:32 -07:00
Kai Fricke 681c24754a [tune] Example for using huggingface hyperparamer_search API (#11158) 2020-10-01 16:00:57 -07:00
Kai Fricke bdf647c4ec [tune] docker syncer (#11035)
* Add DockerSyncer

* Add docs

* Update python/ray/tune/integration/docker.py

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Updated docs

* fix dir

* Added docker integration test

* added docker integration test to bazel build

* Use sdk.rsync API

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-10-01 11:59:23 -07:00
Sumanth Ratna 98ebf8e2d8 [tune][docs] fix typo in Tune FAQ (#11161)
* Fix typo in tune FAQ (used to use)

* Update doc/source/tune/_tutorials/_faq.rst

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-10-01 11:20:41 -07:00
Kai Fricke c77cfaa5ad [tune] use dated experiment dir per default (#11104) 2020-09-30 14:43:59 -07:00
Kai Fricke b8f344f695 [tune] add faq entry for reproducing experiments (setting seeds etc) (#11106) 2020-09-29 14:48:39 -07:00
Kai Fricke e7315b0856 [tune] Callbacks for tune runs (#11001) 2020-09-27 16:50:07 -07:00
Richard Liaw a563344bc2 [docs] remove ref to google groups -> github discussions (#11019) 2020-09-24 18:09:51 -07:00
Kai Fricke d9c4dea7cf [tune] strict metric checking (#10972) 2020-09-24 10:00:48 -07:00
Amog Kamsetty f42ab54112 [Docs] [Tune] Fix Tune Quick Start docs (#10996) 2020-09-24 00:28:01 -07:00
Kai Fricke 50d63b8077 [tune] update pt tutorial docs (#10925)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-09-21 13:33:37 -07:00
Richard Liaw b0ca70f628 [tune+core] tune lifecycle and starting ray guide (#10813) 2020-09-21 11:27:50 -07:00
Kai Fricke 508cfa3540 [tune] Support yield and return statements (#10857)
* Support `yield` and `return` statements in Tune trainable functions

* Support anonymous metric with ``tune.report(value)``

* Raise on invalid return/yield value

* Fix end to end reporter test
2020-09-17 20:18:35 -07:00
Ameer Haj Ali 6edacb22b8 Fix abstraction violations in command_runner interface (#10715)
* Fix abstraction violations in command_runner interface

* user guide

* lint

* breaking abstraction in commands

* extra initialization commands

* more cleanup

* small fixes

* fix test_integration_kubernetes.py

* lint

Co-authored-by: root <root@ip-172-31-28-155.us-west-2.compute.internal>
Co-authored-by: Ameer Haj Ali <ameerhajali@Ameers-MacBook-Pro.local>
2020-09-14 20:28:38 -07:00