Florian HoidnOff-Policy Q-learning in OpenAI Universe: Part 2 — Train Your Own Reward FunctionIntroduction:Dec 10, 2017Dec 10, 2017
Florian HoidnOff-Policy Q-learning in OpenAI Universe: Part 1 — Setting up OpenAI’s Baseline DQNIntroduction:Jul 2, 20171Jul 2, 20171