mirror of
https://github.com/wassname/pytorch-a2c-ppo-acktr.git
synced 2026-06-27 16:20:05 +08:00
f09b3a75e42c36c4ae8cc041c75c9cf7d8fbd2ae
pytorch-a2c
This is a PyTorch implementation of Advantage Actor Critic (A2C), a synchronous deterministic version of A3C "Asynchronous Methods for Deep Reinforcement Learning". Also see the OpenAI post (section A2C and A3C) for more information.
This implementation is inspired by the OpenAI A2C baseline. It uses the same hyper parameters and the model since they were well tuned for Atari games.
Contributions
Contributions are very welcome. If you know how to make this code better, don't hesitate to send a pull request.
Usage
python main.py --env-name "PongNoFrameskip-v4"
Results
More results coming soon.
Languages
Python
100%



