Files
ray/python
Eric Liang 24649726dc [rllib] Use batch.count in async samples optimizer (#2488)
Using the actual batch size reduces the risk of mis-accounting. Here, we under-counted samples since in truncate_episodes mode we were doubling the batch size by accident in policy_evaluator.
2018-07-27 16:44:21 -07:00
..
2018-06-20 10:43:44 -07:00
2018-07-25 14:57:28 -07:00
2018-07-12 19:12:04 +02:00