mirror of https://github.com/wassname/ray.git synced 2026-07-03 13:27:59 +08:00

Files

T

Eric Liang 9b8218aabd [docs] Move all /latest links to /master (#11897 )

* use master link

* remae

* revert non-ray

* more

* mre

2020-11-10 10:53:28 -08:00

Policy Gradient (PG)

An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.