Commit Graph

310 Commits

Author SHA1 Message Date
Amog Kamsetty f95ab4f506 [Testing] Multi-node Training+Tune Long Running Test (#8966) 2020-06-22 14:49:16 -07:00
mehrdadn fc4684d3ca Update pandas to 1.0.5 (#9065)
Co-authored-by: Mehrdad <noreply@github.com>
2020-06-21 14:35:25 -05:00
mehrdadn 981f67bfb0 Fix more Windows issues (#9011)
Co-authored-by: Mehrdad <noreply@github.com>
2020-06-19 18:51:45 -07:00
Max Fitton ad09aa985c Make Dashboard Port Configurable (#8999) 2020-06-19 16:26:22 -05:00
mehrdadn 92f67cd2ae Add Optional Fast Build Configuration (#8925)
* Fast builds by default

* Update doc/source/development.rst

Co-authored-by: Simon Mo <xmo@berkeley.edu>

Co-authored-by: Mehrdad <noreply@github.com>
Co-authored-by: Simon Mo <xmo@berkeley.edu>
2020-06-18 14:12:12 -07:00
mehrdadn 003cec87b4 [CI] Fix Conda Permission on MacOS Github Action(#9004)
Co-authored-by: Mehrdad <noreply@github.com>
2020-06-18 00:00:20 -07:00
Ian Rodney 5208f8db12 [testing] Adding useful error messages for long_running_tests (#8960) 2020-06-17 18:32:35 -07:00
mehrdadn 4afa2b304a Clean up CI ASAN & .bazelrc (#8828) 2020-06-15 17:27:17 -07:00
Stephanie Wang 19d44d4fa9 Use no_restart=False for ray.kill in Serve failure test (#8952) 2020-06-15 15:34:56 -07:00
Richard Liaw 6c49c01837 [tune] Function API checkpointing (#8471)
Co-authored-by: krfricke <krfricke@users.noreply.github.com>
2020-06-15 10:42:54 -07:00
mehrdadn 101c215125 Get more tests running on Windows (#6537)
* Get rid of system() calls

* Work around '/usr/share/mini' showing up on GitHub Actions (probably due to psutil truncation)

https://github.com/ray-project/ray/runs/722480047?check_suite_focus=true

* Don't check for socket max path length on Windows

* Don't check for socket existence on Windows

* Fix race condition in Windows fate-sharing

* Work around missing .exe extension for Redis tests

* Add more tests to GitHub Actions

Co-authored-by: Mehrdad <noreply@github.com>
2020-06-12 21:32:10 -07:00
mehrdadn 07637e5b5b Upgrade Bazel and add required patches (#8847) 2020-06-12 14:59:22 -05:00
SangBin Cho 890cb8cb70 Node failure test fix (#8882) 2020-06-10 16:21:27 -05:00
mehrdadn 67cd9a0c95 CI fixes and improvements (#8849)
Co-authored-by: Mehrdad <noreply@github.com>
2020-06-09 14:07:54 -07:00
SangBin Cho 2b3fa35fbe [Serve] Serve long running test fix (#8864) 2020-06-09 12:17:18 -05:00
Edward Oakes f8d20d25bd [serve] Fix long running failure test (#8863) 2020-06-09 12:13:20 -05:00
mehrdadn b8cc9a1cbb Change GitHub Actions path to one with spaces (#8850)
Co-authored-by: Mehrdad <noreply@github.com>
2020-06-08 23:48:02 -07:00
Simon Mo 3b32cf14a7 Add release test runnning full asan python test (#8836) 2020-06-08 13:22:12 -07:00
Sven Mika c74dc58f8b [RLlib] Fix use_lstm flag for ModelV2 (w/o ModelV1 wrapping) and add it for PyTorch. (#8734) 2020-06-05 15:40:30 +02:00
Siyuan (Ryans) Zhuang ea05ebe89e Ship plasma store with Ray (#7901) 2020-06-03 17:44:34 -07:00
Max Fitton b9f0f7ae5b Dashboard minor refactor and first unit tests (#8705) 2020-06-03 11:04:55 -05:00
Alec Brickner 207ab44129 Raise major version limit for msgpack (#8466) 2020-06-01 20:00:36 -07:00
krfricke e5b6566d28 Remove blocking flag from serve.init() (#8654) 2020-05-29 13:25:35 -07:00
Patrick Ames 76450c8d47 [autoscaler] Honor separate head and worker node subnet IDs (#8374) 2020-05-28 18:16:46 -07:00
Simon Mo b93d6813ae Build from source in Jenkins (#8255) 2020-05-28 09:38:16 -07:00
SangBin Cho 448011f822 0.8.5 Release change. (#8358) 2020-05-28 09:37:19 -07:00
Lingxuan Zuo e594524ed3 [GCS] global state query node info table from GCS. (#8498) 2020-05-28 16:39:13 +08:00
mehrdadn 79a4eac48c Make more tests run on Windows (#8553) 2020-05-26 18:43:34 -05:00
Eric Liang bb8d3c5cd0 ASAN build for ray core tests (#8431) 2020-05-21 15:11:03 -07:00
Edward Oakes a76434ccde Add ability to specify worker and driver ports (#8071) 2020-05-20 15:31:13 -05:00
mehrdadn ebf060d484 Make more tests run on Windows (#8446)
* Remove worker Wait() call due to SIGCHLD being ignored

* Port _pid_alive to Windows

* Show PID as well as TID in glog

* Update TensorFlow version for Python 3.8 on Windows

* Handle missing Pillow on Windows

* Work around dm-tree PermissionError on Windows

* Fix some lint errors on Windows with Python 3.8

* Simplify torch requirements

* Quiet git clean

* Handle finalizer issues

* Exit with the signal number

* Get rid of wget

* Fix some Windows compatibility issues with tests

Co-authored-by: Mehrdad <noreply@github.com>
2020-05-20 12:25:04 -07:00
mehrdadn 8da084bc54 Try to address linting issues (#8485) 2020-05-19 10:29:17 -05:00
Sven Mika 796a834c48 [RLlib] Attention Net integration into ModelV2 and learning RL example. (#8371) 2020-05-18 17:26:40 +02:00
SangBin Cho 1b734ba045 Pin sklearn version (#8465) 2020-05-15 16:54:54 -07:00
mehrdadn cd0037064c Windows wheels for multiple Python versions (#8369)
* Upload wheels to latest directory as well on GitHub Actions

* Fix bug in install-dependencies.sh

* Move out bazel build //:* from install_ray, since it isn't really necessary for that purpose

* Build wheels for different versions of Python on Windows

* Compile Windows in opt mode

Co-authored-by: Mehrdad <noreply@github.com>
2020-05-12 22:06:04 -07:00
Eric Liang 9d012626e5 [rllib] Distributed exec workflow for impala (#8321) 2020-05-11 20:24:43 -07:00
mehrdadn 66b3edccb9 Prefer built-in system compilers over Clang download (#8355)
Co-authored-by: Mehrdad <noreply@github.com>
2020-05-11 11:53:35 -05:00
Sven Mika 2b0817cbd3 [RLlib] Retry pip installs (after waiting n seconds) in install-dependencies.sh (#8354) 2020-05-07 17:39:35 +02:00
Simon Mo c5a5a5de89 [Serve] Refactor Metric System: Counter + Measure Support (#8114) 2020-05-06 17:44:02 -07:00
mehrdadn 4bdef78e2e Various CI fixes and cleanup (#8289) 2020-05-05 10:47:49 -07:00
Maksim Smolin c2acb7ffe2 [SGD] Add imagenet example CI (#8150) 2020-05-02 16:48:35 -07:00
mehrdadn ff68fb8c7c Try to fix tests running all the time (#8280)
Co-authored-by: Mehrdad <noreply@github.com>
2020-05-02 15:37:52 -05:00
Edward Oakes 22cab930cd Retry actor failures in serve failure test (#8282) 2020-05-02 10:19:44 -05:00
Edward Oakes 019030cb4d Add long-running serve failure test (#8277) 2020-05-01 21:07:14 -05:00
mehrdadn bf074073e7 Deploy Windows wheels to Amazon S3 (#8237)
* Deploy to Amazon S3

* Install specifically requested Python version

Co-authored-by: Mehrdad <noreply@github.com>
2020-05-01 14:08:57 -07:00
Edward Oakes 13f718846d [serve] Always use internal KV store (#8270) 2020-05-01 14:18:18 -05:00
Edward Oakes 421b3c9d8b Fix serve long running test (#8268) 2020-05-01 11:54:27 -05:00
mehrdadn 254b1ec370 Set up testing and wheels for Windows on GitHub Actions (#8131)
* Move some Java tests into ci.sh

* Move C++ worker tests into ci.sh

* Define run()

* Prepare to move Python tests into ci.sh

* Fix issues in install-dependencies.sh

* Reload environment for GitHub Actions

* Move wheels to ci.sh and fix related issues

* Don't bypass failures in install-ray.sh anymore

* Make CI a little quieter

* Move linting into ci.sh

* Add vitals test right after build

* Fix os.uname() unavailability on Windows

Co-authored-by: Mehrdad <noreply@github.com>
2020-04-29 21:19:02 -07:00
Simon Mo 1b1fe0cc5b Fix Serve long running test (#8223) 2020-04-29 09:32:39 -07:00
Sven Mika eb91619175 Fix release 0.8.5 tests for PPO torch Breakout. (#8226) 2020-04-29 10:36:41 +02:00