Logo
Explore Help
Register Sign In
wassname/ray
1
0
Fork 0
You've already forked ray
mirror of https://github.com/wassname/ray.git synced 2026-06-29 19:17:01 +08:00
Code Issues Packages Projects Releases Wiki Activity
Files
54215ff287c034fe17c2aa1de5a8a67dc008dca1
ray/rllib/agents/pg
T
History
Sven Mika ef18893fb5 [RLlib] PPO, APPO, and DD-PPO code cleanup. (#10420)
2020-09-02 14:03:01 +02:00
..
tests
ci: Redo format.sh --all script & backfill lint fixes (#9956)
2020-08-07 16:49:49 -07:00
__init__.py
[RLlib] Examples folder restructuring (models) part 1 (#8353)
2020-05-08 08:20:18 +02:00
pg_tf_policy.py
[RLlib] PPO, APPO, and DD-PPO code cleanup. (#10420)
2020-09-02 14:03:01 +02:00
pg_torch_policy.py
[RLlib] First attempt at cleaning up algo code in RLlib: PG. (#10115)
2020-08-20 17:05:57 +02:00
pg.py
[RLlib] First attempt at cleaning up algo code in RLlib: PG. (#10115)
2020-08-20 17:05:57 +02:00
README.md
[RLlib] First attempt at cleaning up algo code in RLlib: PG. (#10115)
2020-08-20 17:05:57 +02:00
utils.py
[RLlib] First attempt at cleaning up algo code in RLlib: PG. (#10115)
2020-08-20 17:05:57 +02:00

README.md

Policy Gradient (PG)

An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.

Detailed Documentation

Implementation

Reference in New Issue View Git Blame Copy Permalink
Powered by Gitea Version: 1.26.4 Page: 41ms Template: 2ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API