Commit Graph

5 Commits

Author SHA1 Message Date
wassname 6bb9c51403 add share normalizer for state and reward 2018-01-21 12:35:13 +08:00
wassname 66d6a74093 add logger 2018-01-19 13:05:32 +08:00
wassname a87a3ad7bb tidy 2018-01-18 17:23:35 +08:00
wassname 0de68133cf ddpg with a dynamics model
I'm trying a dynamics model to provide additional supervision. I'm using
this repo because it's performance tested on a competition and is in
pytorch. I'm intially testing with pendulum. Code is messy as it's a one
time experiment.
2018-01-18 16:41:20 +08:00
Kolesnikov Sergey 7401266fe7 pytorch version 2017-11-15 22:18:46 +03:00