Eric Liang
|
ecb811c26e
|
[rllib] Ape-X implementation and DQN refactor to handle replay in policy optimizer (#1604)
* minimal apex checkin
* cleanup dqn options
* actor utils
* Sun Feb 25 17:39:54 PST 2018
* update
* compression refactor
* fix
* add test
* fix models
* Sun Feb 25 21:46:27 PST 2018
* Wed Feb 28 10:26:34 PST 2018
* Wed Feb 28 10:28:09 PST 2018
* Wed Feb 28 10:42:59 PST 2018
* refactor
* Wed Feb 28 11:17:19 PST 2018
* Wed Feb 28 11:42:08 PST 2018
* Wed Feb 28 11:42:13 PST 2018
* Wed Feb 28 11:59:02 PST 2018
* Wed Feb 28 11:59:58 PST 2018
* Wed Feb 28 12:00:08 PST 2018
* Wed Feb 28 12:02:19 PST 2018
* Wed Feb 28 13:44:31 PST 2018
* Wed Feb 28 17:01:20 PST 2018
* Sat Mar 3 14:55:59 PST 2018
* make optimizer construction explicit
* Sat Mar 3 18:23:08 PST 2018
* Sat Mar 3 18:24:28 PST 2018
* Sat Mar 3 18:49:28 PST 2018
* Sat Mar 3 18:50:42 PST 2018
* Sat Mar 3 18:56:10 PST 2018
|
2018-03-04 12:25:25 -08:00 |
|