Logo
Explore Help
Register Sign In
wassname/ray
1
0
Fork 0
You've already forked ray
mirror of https://github.com/wassname/ray.git synced 2026-07-04 08:11:44 +08:00
Code Issues Packages Projects Releases Wiki Activity
Files
8fb926565cbf0cd554bc24ccf1ea3a47dcc41dad
ray/rllib/agents/pg
T
History
Sven Mika 62c7ab5182 [RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747)
2020-11-12 16:27:34 +01:00
..
tests
[RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747)
2020-11-12 16:27:34 +01:00
__init__.py
[RLlib] Examples folder restructuring (models) part 1 (#8353)
2020-05-08 08:20:18 +02:00
pg_tf_policy.py
[RLlib] PPO, APPO, and DD-PPO code cleanup. (#10420)
2020-09-02 14:03:01 +02:00
pg_torch_policy.py
[RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747)
2020-11-12 16:27:34 +01:00
pg.py
[RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747)
2020-11-12 16:27:34 +01:00
README.md
[docs] Move all /latest links to /master (#11897)
2020-11-10 10:53:28 -08:00
utils.py
[RLlib] First attempt at cleaning up algo code in RLlib: PG. (#10115)
2020-08-20 17:05:57 +02:00

README.md

Policy Gradient (PG)

An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.

Detailed Documentation

Implementation

Reference in New Issue View Git Blame Copy Permalink
Powered by Gitea Version: 1.26.4 Page: 129ms Template: 2ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API