Kaige – Medium

Kaige

Kaige

Not Another Entry-Level Tutorial of Reinforcement Learning

This is a series of posts on ‘advanced bits’ of practical reinforcement learning. Topics will be covered:

May 23

May 23

Kaige

PyTorch Version of Life-Time Value Prediction

A DEEP PROBABILISTIC MODEL FOR CUSTOMER LIFETIME VALUE PREDICTION

Apr 27

Apr 27

Kaige

Paper Reading: Imitation Learning with Concurrent Actions in 3D Games

This paper is from SEED Team Electronic Arts. In this article, we put quotations from this paper to be a summary

Apr 18

Paper Reading: Imitation Learning with Concurrent Actions in 3D Games

Apr 18

Kaige

Paper Reading: Deep Successor Representation

Deep Successor Representation

Apr 14

Paper Reading: Deep Successor Representation

Apr 14

Kaige

Paper Reading: Proto-Value-Networks

Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks

Apr 13

Paper Reading: Proto-Value-Networks

Apr 13

Kaige

How to learn successor representations from data?

This article summaries the methods in the literature o how to learn successor representations from data

Apr 1

How to learn successor representations from data?

Apr 1

Kaige

Paper Reading: Generalization successor features to continuous domains for multi-task learning

As the title shows, this paper demonstrate how to use the learned successor feature in continuous multi-task learning. I put some key…

Mar 27

Paper Reading: Generalization successor features to continuous domains for multi-task learning

Mar 27

Kaige

Paper Reading: Learning Successor States and Goal-Dependent Values: A mathematical Viewpoint

This paper is a deep analysis on successor representation (SR). I put some key insights and points from this paper.

Mar 26

Mar 26

Kaige

Not Another Intrinsic Reward

The following papers made some interesting arguments on intrinsic rewards for exploration. I put them here for reference.

Mar 22

Not Another Intrinsic Reward

Mar 22

Kaige

Paper Reading: What About Inputting Policy in Value Function?

In actor-critic RL agent, the value function is used to predict the state value under the current policy and then used to guide the policy…

Mar 22

Paper Reading: What About Inputting Policy in Value Function?

Mar 22

Kaige

Kaige

Applied Scientist | PhD | UCL

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams