Proximal Policy Optimization (PPO) is one of the leading Reinforcement Learning (RL) algorithms. PPO is…
OpenAI’s retro gym is a great tool for using Reinforcement Learning (RL) algorithms on classic…
In an earlier post, I wrote about a naive way to use human demonstrations to help train a Deep-Q Network (DQN) for Sonic the Hedgehog. After that mostly unsuccessful attempt I read an interesting paper called Deep Q-learning from…
See my prior blog post for an intro to this and this repo for the files I’ll be discussing.
The University of California Deep Reinforcement Learning (RL) course has a lecture by John…
I’ve been attempting to clear all levels in the original Sonic the Hedgehog for Genesis using OpenAI’s retro gym to train a Reinforcement Learning (RL) agent. I’ll have more on that in some upcoming posts. In this post and…
OpenAI held a Retro Contest where competitors trained Reinforcement Learning (RL) agents on Sonic the Hedgehog. The goal of the competition was to train an agent on levels of Sonic from the first three games and see…
I have been playing around with OpenAI’s Retro, which allows you to use gym with old Sega Genesis and Nintendo games. I’ve been doing some experiments on the original Sonic the Hedgehog on my home computer and some cloud instances. I used Google Cloud and Floydhub.com…
The retro-movies repo makes it easy to create human demonstrations for retro gym. I made a script to turn a human demonstrations into frame by frame transitions. Some slight modifications to the Rainbow DQN baseline provided…