Wah Loon Keng – Medium

Wah Loon Keng

Wah Loon Keng

Soft Actor-Critic for continuous and discrete actions

With the Atari benchmark complete for all the core RL algorithms in SLM Lab, I finally had time to implement a new algorithm, Soft…

Aug 11, 2019

Soft Actor-Critic for continuous and discrete actions

Aug 11, 2019

Wah Loon Keng

Multi-inheritance magic in SLM Lab

Deep RL is known to be complex in terms of engineering, because it involves so many things. One goal of SLM Lab is to reduce this…

Sep 26, 2018

Sep 26, 2018

Wah Loon Keng

pip module for RL agents in SLM Lab

As part of the usability improvement in SLM Lab v2.0.0, we have also made it possible to just use the agents outside of the SLM framework…

Sep 5, 2018

Sep 5, 2018

Wah Loon Keng

Deep Reinforcement Learning with SLM Lab

SLM-Lab v2.0.0 was recently released. This version represents a major milestone, as it has grown to over a dozen algorithms while retaining…

Sep 5, 2018

Deep Reinforcement Learning with SLM Lab

Sep 5, 2018

Wah Loon Keng

OpenAI Five DotA: Solving the Global Skill Problem

Laura and I were talking about how we would solve the global skill problem in OpenAI’s DotA Five, which is still evidently lacking from…

Aug 13, 2018

Aug 13, 2018

Wah Loon Keng

Semantic Correspondence Part 2

Read part 1 here:

Aug 10, 2018

Aug 10, 2018

Wah Loon Keng

Semantic correspondence via PowerNet expansion

start with cartpole network, 4 inputs, 2 outputs, train network

Aug 9, 2018

Aug 9, 2018

Wah Loon Keng

OpenAI Five DotA: Next Challenges

If you don’t already know, OpenAI’s Dota Five had just played against some pro players during the benchmark event. It was streamed on…

Aug 7, 2018

OpenAI Five DotA: Next Challenges

Aug 7, 2018

Wah Loon Keng

Falsifiability and General Turing Test

Another continued note to my coauthor.

Jul 24, 2018

Jul 24, 2018

Wah Loon Keng

Stages of Semantics Grounding

This is a correspondence to my coauthor.

Jul 19, 2018

Jul 19, 2018

Wah Loon Keng

Wah Loon Keng

Deep Reinforcement Learning. Semantics. Rock Climbing. https://github.com/kengz

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams