mirror of
https://github.com/wassname/ray.git
synced 2026-06-27 23:08:32 +08:00
995ac24a2c
It's possible to configure PPO in a way that ends up discarding most of the samples (they are treated as "stragglers"). Add a warning when this happens, and raise an exception if the waste is particularly egregious.