Files
ray/python
Eric Liang 995ac24a2c [rllib] clarify train batch size for PPO (#2793)
It's possible to configure PPO in a way that ends up discarding most of the samples (they are treated as "stragglers"). Add a warning when this happens, and raise an exception if the waste is particularly egregious.
2018-09-05 12:06:13 -07:00
..
2018-06-20 10:43:44 -07:00
2018-07-25 14:57:28 -07:00
2018-08-29 13:39:25 -07:00