wassname/ray

mirror of https://github.com/wassname/ray.git synced 2026-07-04 08:11:44 +08:00

Files

T

History

Sven Mika 62c7ab5182 [RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747 )

2020-11-12 16:27:34 +01:00

..

[RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747 )

2020-11-12 16:27:34 +01:00

__init__.py

[RLlib] Examples folder restructuring (models) part 1 (#8353 )

2020-05-08 08:20:18 +02:00

pg_tf_policy.py

[RLlib] PPO, APPO, and DD-PPO code cleanup. (#10420 )

2020-09-02 14:03:01 +02:00

pg_torch_policy.py

[RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747 )

2020-11-12 16:27:34 +01:00

pg.py

[RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747 )

2020-11-12 16:27:34 +01:00

README.md

[docs] Move all /latest links to /master (#11897 )

2020-11-10 10:53:28 -08:00

utils.py

[RLlib] First attempt at cleaning up algo code in RLlib: PG. (#10115 )

2020-08-20 17:05:57 +02:00

README.md

Policy Gradient (PG)

An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.

Detailed Documentation

Implementation