I'm trying a dynamics model to provide additional supervision. I'm using this repo because it's performance tested on a competition and is in pytorch. I'm intially testing with pendulum. Code is messy as it's a one time experiment.