mirror of
https://github.com/wassname/ray.git
synced 2026-06-28 06:31:33 +08:00
74fd3d7e21
* Cleaner and more correct treatment of agent states in rollout.py * support lstm_use_prev_action_reward in rollout.py * Linter. * appease flake8 * Use _DUMMY_AGENT_ID instead of 0. * All agents have a policy_agent_mapping. Reset the mapping cache at the start of each episode. * Update rollout.py * Fix rollout.py for single-agent envs. * Use agent_id, not policy_id.