Zac Wellmer – Medium

Zac Wellmer

Zac Wellmer
in
Arxiv Bytes

Summary: GameGAN

Ideas from this summary are taken from the GameGAN paper.

Jun 23, 2020

Summary: GameGAN

Jun 23, 2020

Zac Wellmer
in
Arxiv Bytes

Summary: Conservative Policy Iteration

Conservative Policy Iteration has 3 goals: (1) an iterative procedure guaranteed to improve a performance metric, (2) terminate in a…

May 7, 2019

May 7, 2019

Zac Wellmer
in
Arxiv Bytes

Summary: SimPLe

Ideas and figures from this summary are taken from Model-Based Reinforcement Learning for Atari(SimPLe).

Apr 25, 2019

Summary: SimPLe

Apr 25, 2019

Zac Wellmer
in
Arxiv Bytes

Summary: PlaNet

Deep Planning Network (PlaNet), is a model-based agent that learns a latent state dynamics model from images and takes actions based on…

Feb 25, 2019

Summary: PlaNet

Feb 25, 2019

Zac Wellmer
in
Arxiv Bytes

Summary: World Models

One of the core issues in Reinforcement Learning is sample complexity. Therefore it’s appealing to train RL agents in a simulator which…

Feb 16, 2019

Summary: World Models

Feb 16, 2019

Zac Wellmer
in
Arxiv Bytes

Summary: Learning Plannable Representations with Causal InfoGAN

The goal of this work is to go about planning a sequence of abstract states towards a goal and then decode the abstract states to their…

Feb 1, 2019

Summary: Learning Plannable Representations with Causal InfoGAN

Feb 1, 2019

Zac Wellmer
in
Arxiv Bytes

Summary: Value Prediction Networks(VPN)

VPN is a deep reinforcement learning architecture that mixes ideas from both model free and model based methods. Generally model based…

Sep 15, 2018

Summary: Value Prediction Networks(VPN)

Sep 15, 2018

Zac Wellmer
in
Arxiv Bytes

Summary: Proximal Policy Optimization(PPO)

Ideas from this summary are taken from the Proximal Policy Optimization paper.

Sep 14, 2018

Summary: Proximal Policy Optimization(PPO)

Sep 14, 2018

Zac Wellmer
in
Arxiv Bytes

Summary: TreeQN

Ideas from this summary are taken from the TreeQN and ATreeC paper.

Sep 13, 2018

Summary: TreeQN

Sep 13, 2018

Zac Wellmer
in
Arxiv Bytes

Summary: Deep Deterministic Policy Gradients

This post is a summary of Continuous Control With Deep Reinforcement Learning.

Nov 10, 2017

Summary: Deep Deterministic Policy Gradients

Nov 10, 2017

Zac Wellmer

Zac Wellmer

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams