Logo
Explore Help
Register Sign In
wassname/ray
1
0
Fork 0
You've already forked ray
mirror of https://github.com/wassname/ray.git synced 2026-06-28 04:23:03 +08:00
Code Issues Packages Projects Releases Wiki Activity
Files
python3.9
ray/rllib/agents/pg
T
History
Sven Mika 99ae7bae05 [RLlib] JAXPolicy prep. PR #1. (#13077)
2020-12-26 20:14:18 -05:00
..
tests
[RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747)
2020-11-12 16:27:34 +01:00
__init__.py
[RLlib] Examples folder restructuring (models) part 1 (#8353)
2020-05-08 08:20:18 +02:00
pg_tf_policy.py
[RLlib] PPO, APPO, and DD-PPO code cleanup. (#10420)
2020-09-02 14:03:01 +02:00
pg_torch_policy.py
[RLlib] JAXPolicy prep. PR #1. (#13077)
2020-12-26 20:14:18 -05:00
pg.py
[RLlib] Fix most remaining RLlib algos for running with trajectory view API. (#12366)
2020-12-01 17:41:10 -08:00
README.md
[docs] Move all /latest links to /master (#11897)
2020-11-10 10:53:28 -08:00
utils.py
[RLlib] First attempt at cleaning up algo code in RLlib: PG. (#10115)
2020-08-20 17:05:57 +02:00

README.md

Policy Gradient (PG)

An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.

Detailed Documentation

Implementation

Reference in New Issue View Git Blame Copy Permalink
Powered by Gitea Version: 1.26.4 Page: 58ms Template: 2ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API