Kaige – Medium

Kaige

Kaige

Ray RLlib: PPO+Action-Mask+Customized Models

This article shows how to integrate action mask and customized models to PPO in Ray RLlib. It contains the following steps:

1d ago

1d ago

Kaige

Ray RLlib: Action-Mask+DQN

This article, how to add discrete action mask into Ray RLlib DQN algorithm. We use Pytorch framework. It contains the following steps

3d ago

3d ago

Kaige

Not Another Entry-Level Tutorial of Reinforcement Learning

This is a series of posts on ‘advanced bits’ of practical reinforcement learning. Topics will be covered:

May 23

May 23

Kaige

PyTorch Version of Life-Time Value Prediction

A DEEP PROBABILISTIC MODEL FOR CUSTOMER LIFETIME VALUE PREDICTION

Apr 27

Apr 27

Kaige

Paper Reading: Imitation Learning with Concurrent Actions in 3D Games

This paper is from SEED Team Electronic Arts. In this article, we put quotations from this paper to be a summary

Apr 18

Paper Reading: Imitation Learning with Concurrent Actions in 3D Games

Apr 18

Kaige

Paper Reading: Deep Successor Representation

Deep Successor Representation

Apr 14

Paper Reading: Deep Successor Representation

Apr 14

Kaige

Paper Reading: Proto-Value-Networks

Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks

Apr 13

Paper Reading: Proto-Value-Networks

Apr 13

Kaige

How to learn successor representations from data?

This article summaries the methods in the literature o how to learn successor representations from data

Apr 1

How to learn successor representations from data?

Apr 1

Kaige

Paper Reading: Generalization successor features to continuous domains for multi-task learning

As the title shows, this paper demonstrate how to use the learned successor feature in continuous multi-task learning. I put some key…

Mar 27

Paper Reading: Generalization successor features to continuous domains for multi-task learning

Mar 27

Kaige

Paper Reading: Learning Successor States and Goal-Dependent Values: A mathematical Viewpoint

This paper is a deep analysis on successor representation (SR). I put some key insights and points from this paper.

Mar 26

Mar 26

Kaige

Kaige

Applied Scientist | PhD | UCL

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams