Policy Gradients in Reinforcement Learning, Homework 2 — CS294
This are our notes to the Homework 2 for the course CS294–112 Berkley — Deep Reinforcement Learning. This document is organised as follows:
- Lecture Review: review of the theoretical concepts, here…