Homepage
Open in app
Sign in
Get started
Stanford AI for Human Impact
Follow
Policy Certificates and Minimax-Optimal PAC Bounds for Episodic Reinforcement Learning
Policy Certificates and Minimax-Optimal PAC Bounds for Episodic Reinforcement Learning
Designing reinforcement learning methods which find a good policy with as few samples as possible is a key goal of both empirical and…
Christoph Dann
Aug 16, 2019
Towards Reinforcement Learning Inspired By Humans Without Human Demonstrations
Towards Reinforcement Learning Inspired By Humans Without Human Demonstrations
Strategic Object Oriented Reinforcement Learning (SOORL)
Ramtin Keramati
May 31, 2018
About Stanford AI for Human Impact
Latest Stories
Archive
About Medium
Terms
Privacy
Teams