ray/python at 5f430da18075878fbefd7b9c33cc22bb65710d9d - ray

mirror of https://github.com/wassname/ray.git synced 2026-06-28 08:55:51 +08:00

Files

T

Eric Liang 5f430da180 [rllib] Provide internal access to episode state in compute_actions() and allow returning extra batches (#2559 )

The goal of this PR is to allow custom policies to perform model-based rollouts. In the multi-agent setting, this requires access to not only policies of other agents, but also their current observations.
Also, you might want to return the model-based trajectories as part of the rollout for efficiency.

  compute_actions() now takes a new keyword arg episodes
  pull out internal episode class into a top-level file
  add function to return extra trajectories from an episode that will be appended to the sample batch
  documentation

2018-08-16 14:37:21 -07:00

benchmarks

[asv] Add benchmark for ray.wait (#2625 )

2018-08-10 17:52:36 -07:00

ray

[rllib] Provide internal access to episode state in compute_actions() and allow returning extra batches (#2559 )

2018-08-16 14:37:21 -07:00

asv.conf.json

[asv] Pushing to s3 (#2246 )

2018-06-20 10:43:44 -07:00

build-wheel-macos.sh

Fix MAC_WHEELS=1 (#2477 )