#3: RLiable to Save RL Research, NeurIPS is Here, Offline RL Gains Steam, Check out COMARL

Enes Bilgin

Published in

RL Agent

Sent as a

Newsletter

3 min readDec 7, 2021

RLiable to Save RL Research

Deep RL is notorious for the variability in training, and it turns out many benchmarks have long neglected that. In their recent paper “Deep RL at the Edge of the Statistical Precipice”, Rishabh Agarwal and researchers from Google Brain and MILA discuss the subjectivity that goes into RL comparisons when this variability is not reported. To remediate that, their paper, an Outstanding Paper awardee at NeurIPS 2021, proposes more descriptive statistics for better evaluation of RL algorithms. Also to this end, the researchers introduce RLiable, an open-source Python library.

The distribution of median human normalized scores on the Atari 100k benchmark, which contains 26 games, for five recently published algorithms, along with the reported results in the corresponding papers. Most overestimate the success of the algorithms. Source: Google AI Blog.

NeurIPS is On and RL is Making a Splash

NeurIPS is progressing at full speed and there is a lot going on when it comes to RL. Here is what not to miss:

Outstanding Paper Awardees:

Oral Sessions:

Workshops [registration required]:

Offline RL Gains Steam

RL Datasets by Google. Source: Google AI Blog

Offline RL has a lot of promise as it learns directly from datasets. One challenge, though, is to collect, record, annotate and share the data that can be then easily used in an offline RL setting. Google Brain takes on this challenge with their new Reinforcement Learning Datasets (RLDS) ecosystem, which offers a lot of convenience in every step of the data pipeline. New to offline RL? Get started here.