Javier Abellán AbenzainNeurosapiens9. Oject detectionfast.ai DL2 Lesson 9: Single Shot Detection detailed walkthroughJan 13, 2019Jan 13, 2019
Javier Abellán AbenzainNeurosapiens11. Multi Agent RLMonte Carlo Tree Search (MCTS)Jan 5, 2019Jan 5, 2019
Javier Abellán AbenzainNeurosapiens10. Actor Critic MethodsDeep Deterministic Policy Gradients (DDPG)Jan 5, 2019Jan 5, 2019
Javier Abellán AbenzainNeurosapiens9. Policy Gradient MethodsGeneralized Advantage Estimation (GAE), Trust Region Policy Optimization (TRPO) and Proximal Policy Optimization (PPO).Jan 5, 2019Jan 5, 2019
Javier Abellán AbenzainNeurosapiens8. Policy Based MethodsEvolutionary algorithms, stochastic policy search, and REINFORCE algorithm.Jan 5, 2019Jan 5, 2019
Javier Abellán AbenzainNeurosapiens7. Value Based MethodsDeep Q-Network (DQN), along with Double-DQN, Dueling-DQN, and Prioritized Replay.Jan 5, 2019Jan 5, 2019
Javier Abellán AbenzainNeurosapiens5. RL in Continuous SpacesLearn how to adapt traditional algorithms to work with continuous spaces. Discretization. Tile CodingJan 5, 2019Jan 5, 2019
Javier Abellán AbenzainNeurosapiens4. Temporal-Difference LearningLearn the difference between the Sarsa, Q-Learning, and Expected Sarsa algorithms.Jan 5, 2019Jan 5, 2019