Reinforcement learning, from 0 to something in 60 days

Cédric Bellet
Biffures
Published in
2 min readFeb 26, 2018

In this series, I will attempt to learn something about reinforcement learning in a limited period of time, after work hours, with the intent to have a somewhat performant system built by the end of that period. A system that plays Atari games maybe, or a trading system — we will see.

There is no intended audience for this series, though if you wish to follow this, you should know that I know some fundamental of machine learnings but am no expert. I will try to learn as much as possible, with 50% planning, 50% improvisation as I will run into yet unknown unknowns. If you are a pro, you might find this boring and inaccurate, if you know nothing about maths and stats, I cannot guarantee you will like this either.

Below is my work plan, which will also be the summary of my findings, when I produce the associated articles. I will come back to this work plan and update it as we go. Today is Feb 28, 2018, and here is what I think we will explore, until end of April 2018:

--

--