wassname/ray: An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library. - ray

mirror of https://github.com/wassname/ray.git synced 2026-07-03 08:53:53 +08:00

T

Sam Toyer 663e92ab3f [rllib] TD3/DDPG improvements and MuJoCo benchmarks (#4694 )

* [rllib] Separate optimisers for DDPG actor & crit.

* [rllib] Better names for DDPG variables & options

Config changes:

- noise_scale -> exploration_ou_noise_scale
- exploration_theta -> exploration_ou_theta
- exploration_sigma -> exploration_ou_sigma
- act_noise -> exploration_gaussian_sigma
- noise_clip -> target_noise_clip

* [rllib] Make DDPG less class-y

Used functions to replace three classes with only an __init__ method & a
handful of unrelated attributes.

* [rllib] Refactor DDPG noise

* [rllib] Unify DDPG exploration annealing

Added option "exploration_should_anneal" to enable linear annealing of
exploration noise. By default this is off, for consistency with DDPG &
TD3 papers. Also renamed "exploration_final_eps" to
"exploration_final_scale" (that name seems to have been carried over
from DQN, and doesn't really make sense here). Finally, tried to rename
"eps" to "noise_scale" wherever possible.

2019-04-26 17:49:53 -07:00

.github

Add lint advisory to PR template

2019-03-29 16:49:02 -07:00

bazel

[Bazel] Use rules_jvm_external to manage java dependencies (#4615 )

2019-04-18 16:53:25 +08:00

Update Release Process documentation (#4670 )

2019-04-25 00:05:19 -07:00

dev

Update Release Process documentation (#4670 )

2019-04-25 00:05:19 -07:00

doc

[rllib] TD3/DDPG improvements and MuJoCo benchmarks (#4694 )

2019-04-26 17:49:53 -07:00

docker

Remove CMake files (#4493 )

2019-04-02 22:17:33 -07:00

examples

Enforce quoting style in Travis. (#4589 )

2019-04-11 14:24:26 -07:00

java

Refactor command line argument parsing with gflags (#4676 )

2019-04-24 14:53:07 +08:00

kubernetes

fixed typo in kuber yaml (#4582 )

2019-04-08 23:13:42 -07:00

python

[rllib] TD3/DDPG improvements and MuJoCo benchmarks (#4694 )

2019-04-26 17:49:53 -07:00

site

Update Gemfile Jekyll version (#3140 )

2018-10-25 21:43:08 -07:00

src/ray

Integrate metric items into raylet (#4602 )

2019-04-25 11:40:24 +08:00

thirdparty/scripts

Remove the old web UI (#4301 )

2019-03-07 23:15:11 -08:00

tools

Integrate metrics (#4246 )

2019-04-02 21:01:02 -07:00

.bazelrc

Avoid redundant bazel build (#4458 )

2019-03-23 10:44:11 +08:00

.clang-format

Remove legacy Ray code. (#3121 )

2018-10-26 13:36:58 -07:00

.gitignore

[Bazel] Use rules_jvm_external to manage java dependencies (#4615 )

2019-04-18 16:53:25 +08:00

.style.yapf

YAPF, take 3 (#2098 )

2018-05-19 16:07:28 -07:00

.travis.yml

Fixe flakequotes to allow escaping quotes (#4666 )

2019-04-19 13:55:20 -07:00

build-docker.sh

adding -x flag for better debugging during builds (#1079 )

2017-10-04 13:56:14 -07:00

BUILD.bazel

Refactor command line argument parsing with gflags (#4676 )

2019-04-24 14:53:07 +08:00

build.sh

[Bazel] Use rules_jvm_external to manage java dependencies (#4615 )

2019-04-18 16:53:25 +08:00

CONTRIBUTING.rst

Direct people to stackoverflow for questions about usage. (#3830 )

2019-01-23 13:30:02 -08:00

LICENSE

[rllib] add augmented random search (#2714 )

2018-08-24 22:20:02 -07:00

pylintrc

adding pylint (#233 )

2016-07-08 12:39:11 -07:00

README.rst

Bump version to 0.7.0dev3 (#4671 )

2019-04-19 17:06:14 -07:00

scripts

Lint script link broken, also lint filter was broken for generated py files (#4133 )

2019-02-22 17:33:08 -08:00

setup_thirdparty.sh

update ray cmake build process (#2853 )

2018-09-12 11:19:33 -07:00

WORKSPACE

use an alternative boost mirror (#4685 )

2019-04-23 11:33:22 -07:00

README.rst

.. image:: https://github.com/ray-project/ray/raw/master/doc/source/images/ray_header_logo.png

.. image:: https://travis-ci.com/ray-project/ray.svg?branch=master
    :target: https://travis-ci.com/ray-project/ray

.. image:: https://readthedocs.org/projects/ray/badge/?version=latest
    :target: http://ray.readthedocs.io/en/latest/?badge=latest

.. image:: https://img.shields.io/badge/pypi-0.6.6-blue.svg
    :target: https://pypi.org/project/ray/

|

**Ray is a flexible, high-performance distributed execution framework.**


Ray is easy to install: ``pip install ray``

Example Use
-----------

+------------------------------------------------+----------------------------------------------------+
| **Basic Python**                               | **Distributed with Ray**                           |
+------------------------------------------------+----------------------------------------------------+
|.. code-block:: python                          |.. code-block:: python                              |
|                                                |                                                    |
|  # Execute f serially.                         |  # Execute f in parallel.                          |
|                                                |                                                    |
|                                                |  @ray.remote                                       |
|  def f():                                      |  def f():                                          |
|      time.sleep(1)                             |      time.sleep(1)                                 |
|      return 1                                  |      return 1                                      |
|                                                |                                                    |
|                                                |                                                    |
|                                                |  ray.init()                                        |
|  results = [f() for i in range(4)]             |  results = ray.get([f.remote() for i in range(4)]) |
+------------------------------------------------+----------------------------------------------------+


Ray comes with libraries that accelerate deep learning and reinforcement learning development:

- `Tune`_: Hyperparameter Optimization Framework
- `RLlib`_: Scalable Reinforcement Learning
- `Distributed Training <http://ray.readthedocs.io/en/latest/distributed_sgd.html>`__

.. _`Tune`: http://ray.readthedocs.io/en/latest/tune.html
.. _`RLlib`: http://ray.readthedocs.io/en/latest/rllib.html

Installation
------------

Ray can be installed on Linux and Mac with ``pip install ray``.

To build Ray from source or to install the nightly versions, see the `installation documentation`_.

.. _`installation documentation`: http://ray.readthedocs.io/en/latest/installation.html

More Information
----------------

- `Documentation`_
- `Tutorial`_
- `Blog`_
- `Ray paper`_
- `Ray HotOS paper`_

.. _`Documentation`: http://ray.readthedocs.io/en/latest/index.html
.. _`Tutorial`: https://github.com/ray-project/tutorial
.. _`Blog`: https://ray-project.github.io/
.. _`Ray paper`: https://arxiv.org/abs/1712.05889
.. _`Ray HotOS paper`: https://arxiv.org/abs/1703.03924

Getting Involved
----------------

- `ray-dev@googlegroups.com`_: For discussions about development or any general
  questions.
- `StackOverflow`_: For questions about how to use Ray.
- `GitHub Issues`_: For reporting bugs and feature requests.
- `Pull Requests`_: For submitting code contributions.

.. _`ray-dev@googlegroups.com`: https://groups.google.com/forum/#!forum/ray-dev
.. _`GitHub Issues`: https://github.com/ray-project/ray/issues
.. _`StackOverflow`: https://stackoverflow.com/questions/tagged/ray
.. _`Pull Requests`: https://github.com/ray-project/ray/pulls

Description

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Readme Multiple Licenses 111 MiB

Languages

Python 56.6%

C++ 28.8%

Java 8.5%

TypeScript 1.7%

Starlark 1.4%

Other 2.8%