ray/python at 0b6112b726e10a66e63f44f1508e439aecf01faa - ray

mirror of https://github.com/wassname/ray.git synced 2026-06-28 04:23:03 +08:00

Files

T

Eric Liang 0b6112b726 [rllib] Part 1 of multiagent support: make sampler path support multiagent envs (#2268 )

This refactors the RLlib sampler to support multi-agent environments. The main changes were:

AsyncVectorEnv now produces dicts of env_id -> agent_id -> value rather than env_id -> value. This lets it model both vectorized and multi-agent envs (or both).
The sampler class operates over the above nested dict structure for all envs. Single agent envs just return a dict with one agent_id=single_agent.
When sample() is called on a policy evaluator, in the single agent case we return a SampleBatch, otherwise we return a MultiAgentBatch (which is a list of sample batches per policy).
Left for another PR:

Exposing multi-agent in the public interfaces.
Optimizations such as evaluating multiple policies in one TF run.

2018-06-23 18:32:16 -07:00

benchmarks

Add Docker Support for ASV (#2184 )

2018-06-05 15:55:35 -07:00

ray

[rllib] Part 1 of multiagent support: make sampler path support multiagent envs (#2268 )

2018-06-23 18:32:16 -07:00

asv.conf.json

[asv] Pushing to s3 (#2246 )

2018-06-20 10:43:44 -07:00

build-wheel-macos.sh

Incorporate C++ Buffer management and Seal global threadpool fix from arrow (#1950 )