Simple Reinforcement Learning with Tensorflow Part 4: Deep Q-Networks and Beyond
Arthur Juliani

Hi, I tried modifying your implementation to make it work with pong, but for some reason it’s not learning. Care to look at the code and see if I am missing something obvious?

One clap, two clap, three clap, forty?

By clapping more or less, you can signal to us which stories really stand out.