mirror of
https://github.com/wassname/ray.git
synced 2026-06-28 21:46:57 +08:00
995ac24a2c
It's possible to configure PPO in a way that ends up discarding most of the samples (they are treated as "stragglers"). Add a warning when this happens, and raise an exception if the waste is particularly egregious.