Sign in Get started

Stanford AI for Human Impact

Policy Certificates and Minimax-Optimal PAC Bounds for Episodic Reinforcement Learning

Policy Certificates and Minimax-Optimal PAC Bounds for Episodic Reinforcement Learning

Designing reinforcement learning methods which find a good policy with as few samples as possible is a key goal of both empirical and…

Aug 16, 2019

Towards Reinforcement Learning Inspired By Humans Without Human Demonstrations

Towards Reinforcement Learning Inspired By Humans Without Human Demonstrations

Strategic Object Oriented Reinforcement Learning (SOORL)

Ramtin Keramati

May 31, 2018

About Stanford AI for Human ImpactLatest StoriesArchiveAbout MediumTermsPrivacyTeams