Files
ray/doc/source
Eric Liang 995ac24a2c [rllib] clarify train batch size for PPO (#2793)
It's possible to configure PPO in a way that ends up discarding most of the samples (they are treated as "stragglers"). Add a warning when this happens, and raise an exception if the waste is particularly egregious.
2018-09-05 12:06:13 -07:00
..
2018-08-19 11:00:55 -07:00
2018-07-01 00:05:08 -07:00
2018-08-19 11:00:55 -07:00
2018-07-01 00:05:08 -07:00
2018-08-19 11:00:55 -07:00
2018-07-01 00:05:08 -07:00
2018-01-25 16:39:00 -08:00
2018-08-19 11:00:55 -07:00