mirror of
https://github.com/wassname/ray.git
synced 2026-06-28 08:23:44 +08:00
9b8218aabd
* use master link * remae * revert non-ray * more * mre
307 B
307 B
Policy Gradient (PG)
An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.