Eq 1. Q(s,a) = r + γ(max(Q(s’,a’))Simple Reinforcement Learning with Tensorflow Part 0: Q-Learning with Tables and Neural Networks14.5K107Arthur JulianiMarcelo NovaesFollowJul 26, 2017 · 1 min readmissing a parenthesis here :P