mirror of
https://github.com/wassname/ray.git
synced 2026-07-05 01:59:42 +08:00
946ebfaa3c
* Validate that entropy coeff is not an integer Passing an integer value for entropy coeff such as 0 raises an error somewhere inside the TF policy graph, so this checks to make sure the entropy coeff is a float. * Cast to float instead Also move this check after the negative value check