mirror of
https://github.com/wassname/ray.git
synced 2026-06-28 03:34:48 +08:00
24649726dc
Using the actual batch size reduces the risk of mis-accounting. Here, we under-counted samples since in truncate_episodes mode we were doubling the batch size by accident in policy_evaluator.