Published inTDS ArchiveUnderstanding and Implementing Distributed Prioritized Experience Replay (Horgan et al., 2018)Accelerating Deep Reinforcement Learning with Distributed ArchitecturesNov 25, 2019Nov 25, 2019
Published inTDS ArchiveIn-depth review of Soft Actor-CriticUnderstanding State-of-the-Art Reinforcement Learning AlgorithmsNov 24, 2019A response icon3Nov 24, 2019A response icon3
Published inTDS ArchiveDueling Deep Q NetworksDueling Network Architectures for Deep Reinforcement LearningOct 19, 2019A response icon3Oct 19, 2019A response icon3
Published inTDS ArchiveDouble Deep Q NetworksTackling maximization bias in Deep Q-learningJul 17, 2019A response icon5Jul 17, 2019A response icon5
Published inTDS ArchiveVanilla Deep Q NetworksDeep Q Learning ExplainedJul 15, 2019A response icon2Jul 15, 2019A response icon2
Published inTDS ArchiveDeep Deterministic Policy Gradients ExplainedReinforcement Learning in Continuous Action SpacesMar 20, 2019A response icon10Mar 20, 2019A response icon10
Published inTDS ArchiveCan Artificial Intelligence Help Medical Decision Making?A Reinforcement Learning based Intelligent Physician for Chemotherapy TreatmentMar 19, 2019Mar 19, 2019
Published inTDS ArchiveUnderstanding Actor Critic MethodsPreliminariesFeb 6, 2019A response icon41Feb 6, 2019A response icon41
Deriving Policy Gradients and Implementing REINFORCEPolicy gradient methods are ubiquitous in model free reinforcement learning algorithms — they appear frequently in reinforcement learning…Dec 30, 2018A response icon21Dec 30, 2018A response icon21