Carsten Friedrich – Medium

Carsten Friedrich

Carsten Friedrich

Part 8 — Tic Tac Toe with Policy Gradient Descent

Do you really need values if you have good policies?

Jul 20, 2018

Jul 20, 2018

Carsten Friedrich

Part 7 - This is deep. In a convoluted way.

One trick that we haven't tried yet is the use of Convolutional Neural Network (CNN) layers. Time to do so now.

Jun 6, 2018

Part 7 - This is deep. In a convoluted way.

Jun 6, 2018

Carsten Friedrich

Part 6 - Double Duelling Q Network with Experience Replay

In the previous part we discovered that just being a bit less greedy in our action policy was not enough.

Jun 6, 2018

Part 6 - Double Duelling Q Network with Experience Replay

Jun 6, 2018

Carsten Friedrich

Part 5 - Q Network review and becoming less greedy

Looking at what went wrong and becoming less greedy

Jun 6, 2018

Jun 6, 2018

Carsten Friedrich

Part 4 - Neural Network Q Learning, a Tic Tac Toe player that learns - kind of

Training a Neural Network to learn the Tic Tac Toe Q function

Jun 6, 2018

Part 4 - Neural Network Q Learning, a Tic Tac Toe player that learns - kind of

Jun 6, 2018

Carsten Friedrich

Part 3 - Tabular Q Learning, a Tic Tac Toe player that gets better and better

In this part, we will introduce our first player which actually uses a machine learning approach to playing Tic Tac Toe.

Jun 6, 2018

Part 3 - Tabular Q Learning, a Tic Tac Toe player that gets better and better

Jun 6, 2018

Carsten Friedrich

Part 2 - The Min Max Algorithm

In this Notebook, we will introduce and then use the Min-Max algorithm to create a computer player which will be able to play Tic Tac Toe.

Jun 6, 2018

Part 2 - The Min Max Algorithm

Jun 6, 2018

Carsten Friedrich

Part 1 - Computer Tic Tac Toe Basics

Basic Tic Tac Toe support classes and game logic

Jun 6, 2018

Jun 6, 2018

Carsten Friedrich

Teaching a computer to play Tic Tac Toe

From classic algorithms to Reinforcement learning with Neural Networks

Jun 6, 2018

Teaching a computer to play Tic Tac Toe

Jun 6, 2018

Carsten Friedrich

Carsten Friedrich

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams