mirror of
https://github.com/wassname/ray.git
synced 2026-06-28 03:34:48 +08:00
68660453e4
This adds a simple DQN+PPO example for multi-agent. We don't do anything fancy here, just syncing weights between two separate trainers. This potentially is wasting some compute, but is very simple to set up. It might be nice to share experience collection between the top-level trainers in the future.