Become a member
Sign in

Eq 1. Q(s,a) = r + γ(max(Q(s’,a’))

Simple Reinforcement Learning with Tensorflow Part 0: Q-Learning with Tables and Neural Networks
14.5K
107
Arthur Juliani
Marcelo Novaes
Marcelo Novaes
Jul 26, 2017 · 1 min read

missing a parenthesis here :P

    Marcelo Novaes

    Written by

    Marcelo Novaes

    Write the first response

    Discover Medium

    Welcome to a place where words matter. On Medium, smart voices and original ideas take center stage - with no ads in sight. Watch

    Make Medium yours

    Follow all the topics you care about, and we’ll deliver the best stories for you to your homepage and inbox. Explore

    Become a member

    Get unlimited access to the best stories on Medium — and support writers while you’re at it. Just $5/month. Upgrade
    AboutHelpLegal