Files
ray/doc/source
Sven Mika d0fab84e4d [RLlib] DDPG PyTorch version. (#7953)
The DDPG/TD3 algorithms currently do not have a PyTorch implementation. This PR adds PyTorch support for DDPG/TD3 to RLlib.
This PR:
- Depends on the re-factor PR for DDPG (Functional Algorithm API).
- Adds learning regression tests for the PyTorch version of DDPG and a DDPG (torch)
- Updates the documentation to reflect that DDPG and TD3 now support PyTorch.

* Learning Pendulum-v0 on torch version (same config as tf). Wall time a little slower (~20% than tf).
* Fix GPU target model problem.
2020-04-16 10:20:01 +02:00
..
2020-04-15 12:25:37 -07:00
2020-04-15 12:25:37 -07:00
2018-07-01 00:05:08 -07:00
2019-11-19 16:15:08 -08:00
2018-07-01 00:05:08 -07:00
2020-04-15 12:25:37 -07:00
2019-04-09 20:59:17 -07:00
2018-07-01 00:05:08 -07:00
2019-12-17 15:56:50 -08:00
2018-01-25 16:39:00 -08:00