Simple Reinforcement Learning with Tensorflow Part 4: Deep Q-Networks and Beyond
Arthur Juliani
59037

Hi, I tried modifying your implementation to make it work with pong, but for some reason it’s not learning. Care to look at the code and see if I am missing something obvious?
https://gist.github.com/parthsharma1996/7f5997e9e6435b9144b3f12a725f6fc5

One clap, two clap, three clap, forty?

By clapping more or less, you can signal to us which stories really stand out.