wassname/ray: An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library. - ray

mirror of https://github.com/wassname/ray.git synced 2026-07-04 13:05:25 +08:00

T

Eric Liang b6c42f96be Auto-scale ray clusters based on GCS load metrics (#1348 )

This adds (experimental) auto-scaling support for Ray clusters based on GCS load metrics. The auto-scaling algorithm is as follows:

Based on current (instantaneous) load information, we compute the approximate number of "used workers". This is based on the bottleneck resource, e.g. if 8/8 GPUs are used in a 8-node cluster but all the CPUs are idle, the number of used nodes is still counted as 8. This number can also be fractional.
We scale that number by 1 / target_utilization_fraction and round up to determine the target cluster size (subject to the max_workers constraint). The autoscaler control loop takes care of launching new nodes until the target cluster size is met.
When a node is idle for more than idle_timeout_minutes, we remove it from the cluster if that would not drop the cluster size below min_workers.
Note that we'll need to update the wheel in the example yaml file after this PR is merged.

2017-12-31 14:39:57 -08:00

.github

Add docs for contributors. (#1191 )

2017-11-10 00:40:19 -08:00

.travis

Add a distributed Dataframe API to Ray (#1330 )

2017-12-20 09:31:22 -08:00

cmake/Modules

Second Part of Internal API Refactor (#1326 )

2017-12-26 16:22:04 -08:00

doc

[rllib] [tune] Custom preprocessors and models, various fixes (#1372 )

2017-12-28 13:19:04 -08:00

docker

Fixing the jenkins tests (#1299 )

2017-12-07 17:03:58 -08:00

examples

[carla] In carla example, save all images and measurements to local disk (#1350 )

2017-12-21 15:19:55 -08:00

python

Auto-scale ray clusters based on GCS load metrics (#1348 )

2017-12-31 14:39:57 -08:00

site

0.3 release blog post. (#1274 )

2017-11-30 16:24:34 -08:00

src

Update arrow, and pass memcopy_threads into put. (#1374 )

2017-12-31 13:32:06 -08:00

test

Auto-scale ray clusters based on GCS load metrics (#1348 )

2017-12-31 14:39:57 -08:00

vsprojects

Windows compatibility (#57 )

2016-11-22 17:04:24 -08:00

.clang-format

Implement object table notification subscriptions and switch to using Redis modules for object table. (#134 )

2016-12-18 18:19:02 -08:00

.editorconfig

Update Windows support (#317 )

2016-07-28 13:11:13 -07:00

.gitignore

[docs] Add backlinks from hyperopt / rl algorithm examples to the built-on Ray libraries (#1356 )

2017-12-23 00:31:33 -08:00

.style.yapf

Make Monitor remove dead Redis entries from exiting drivers. (#994 )

2017-09-26 00:11:38 -07:00

.travis.yml

[rllib] Evaluators and Optimizers Refactoring (#1339 )

2017-12-30 00:24:54 -08:00

build-docker.sh

adding -x flag for better debugging during builds (#1079 )

2017-10-04 13:56:14 -07:00

build.sh

Changes to build to fix creation of wheels. (#840 )

2017-08-21 17:49:35 -07:00

CMakeLists.txt

Second Part of Internal API Refactor (#1326 )

2017-12-26 16:22:04 -08:00

CONTRIBUTING.rst

Add docs for contributors. (#1191 )

2017-11-10 00:40:19 -08:00

LICENSE

[rllib] Basic port of baselines/deepq to rllib (#709 )

2017-07-07 18:37:00 +00:00

pylintrc

adding pylint (#233 )

2016-07-08 12:39:11 -07:00

Ray.sln

Windows compatibility (#57 )

2016-11-22 17:04:24 -08:00

README.rst

[rllib] Update RLlib docs and README (#1288 )

2017-12-06 18:17:51 -08:00

README.rst

Ray
===

.. image:: https://travis-ci.org/ray-project/ray.svg?branch=master
    :target: https://travis-ci.org/ray-project/ray

.. image:: https://readthedocs.org/projects/ray/badge/?version=latest
    :target: http://ray.readthedocs.io/en/latest/?badge=latest

|

Ray is a flexible, high-performance distributed execution framework.

Ray comes with libraries that accelerate deep learning and reinforcement learning development:

- `Ray.tune`_: Efficient Distributed Hyperparameter Search
- `Ray RLlib`_: A Composable and Scalable Reinforcement Learning Library

.. _`Ray.tune`: http://ray.readthedocs.io/en/latest/tune.html
.. _`Ray RLlib`: http://ray.readthedocs.io/en/latest/rllib.html


Installation
------------

- Ray can be installed on Linux and Mac with ``pip install ray``.
- To build Ray from source, see the instructions for `Ubuntu`_ and `Mac`_.

.. _`Ubuntu`: http://ray.readthedocs.io/en/latest/install-on-ubuntu.html
.. _`Mac`: http://ray.readthedocs.io/en/latest/install-on-macosx.html


Example Program
---------------

+------------------------------------------------+----------------------------------------------+
| **Basic Python**                               | **Distributed with Ray**                     |
+------------------------------------------------+----------------------------------------------+
|.. code:: python                                |.. code-block:: python                        |
|                                                |                                              |
|  import time                                   |  import time                                 |
|                                                |  import ray                                  |
|                                                |                                              |
|                                                |  ray.init()                                  |
|                                                |                                              |
|                                                |  @ray.remote                                 |
|  def f():                                      |  def f():                                    |
|      time.sleep(1)                             |      time.sleep(1)                           |
|      return 1                                  |      return 1                                |
|                                                |                                              |
|  # Execute f serially.                         |  # Execute f in parallel.                    |
|  results = [f() for i in range(4)]             |  object_ids = [f.remote() for i in range(4)] |
|                                                |  results = ray.get(object_ids)               |
+------------------------------------------------+----------------------------------------------+


More Information
----------------

- `Documentation`_
- `Tutorial`_
- `Blog`_
- `Ray HotOS paper`_

.. _`Documentation`: http://ray.readthedocs.io/en/latest/index.html
.. _`Tutorial`: https://github.com/ray-project/tutorial
.. _`Blog`: https://ray-project.github.io/
.. _`Ray HotOS paper`: https://arxiv.org/abs/1703.03924

Getting Involved
----------------

- Ask questions on our mailing list `ray-dev@googlegroups.com`_.
- Please report bugs by submitting a `GitHub issue`_.
- Submit contributions using `pull requests`_.

.. _`ray-dev@googlegroups.com`: https://groups.google.com/forum/#!forum/ray-dev
.. _`GitHub issue`: https://github.com/ray-project/ray/issues
.. _`pull requests`: https://github.com/ray-project/ray/pulls

Description

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Readme Multiple Licenses 111 MiB

Languages

Python 56.6%

C++ 28.8%

Java 8.5%

TypeScript 1.7%

Starlark 1.4%

Other 2.8%