Reinforcement learning, from 0 to something in 60 days

Published in

Biffures

2 min readFeb 26, 2018

In this series, I will attempt to learn something about reinforcement learning in a limited period of time, after work hours, with the intent to have a somewhat performant system built by the end of that period. A system that plays Atari games maybe, or a trading system — we will see.

There is no intended audience for this series, though if you wish to follow this, you should know that I know some fundamental of machine learnings but am no expert. I will try to learn as much as possible, with 50% planning, 50% improvisation as I will run into yet unknown unknowns. If you are a pro, you might find this boring and inaccurate, if you know nothing about maths and stats, I cannot guarantee you will like this either.

Below is my work plan, which will also be the summary of my findings, when I produce the associated articles. I will come back to this work plan and update it as we go. Today is Feb 28, 2018, and here is what I think we will explore, until end of April 2018:

A verification that even single neurons can do most of the things I thought was pretty advanced and cool back in university
A reflection on why neural networks don’t actually use those amazing single neurons and instead use simpler versions of them
A super fast rundown of Tensforflow basics, and maybe a couple of examples of neural nets created with Tensorflow
Reinforcement learning fundamentals (part 1, part 2)
Applied reinforcement learning with Tensorforce, and a working system

Reinforcement learning, from 0 to something in 60 days

Written by Cédric Bellet