DixitaniketAdvantage Actor-Critic (A2C) Algorithm Explained and Implemented in PyTorchUnderstanding the A2C AlgorithmJul 16
Renu KhandelwalUnlocking the Secrets of Actor-Critic Reinforcement Learning: A Beginner’s GuideUnderstanding Actor-Critic Mechanisms, Different Flavors of Actor-Critic Algorithms, and a Simple Implementation in PyTorchFeb 21, 20233
Gabriel CassimiroinGeek CultureA Deep Dive into the DDPG Algorithm for Continuous ControlFull project walkthrough with the implementation of the DDPG algorithm for the Continuous Control problem of the Reacher environment.Apr 14, 20232Apr 14, 20232
Masoud ShokrnezhadMastering Lunar Lander with Policy Gradient Algorithms: A Journey into Reinforcement LearningIntroduction: The Lunar Lander ChallengeJun 30Jun 30
Sthanikam SanthoshIntrinsic Curiosity — Reinforcement learningIntrinsic curiosity is a type of reinforcement learning in which the reward signal is generated internally by the agent rather than being…Jan 7, 2023Jan 7, 2023
DixitaniketAdvantage Actor-Critic (A2C) Algorithm Explained and Implemented in PyTorchUnderstanding the A2C AlgorithmJul 16
Renu KhandelwalUnlocking the Secrets of Actor-Critic Reinforcement Learning: A Beginner’s GuideUnderstanding Actor-Critic Mechanisms, Different Flavors of Actor-Critic Algorithms, and a Simple Implementation in PyTorchFeb 21, 20233
Gabriel CassimiroinGeek CultureA Deep Dive into the DDPG Algorithm for Continuous ControlFull project walkthrough with the implementation of the DDPG algorithm for the Continuous Control problem of the Reacher environment.Apr 14, 20232
Masoud ShokrnezhadMastering Lunar Lander with Policy Gradient Algorithms: A Journey into Reinforcement LearningIntroduction: The Lunar Lander ChallengeJun 30
Sthanikam SanthoshIntrinsic Curiosity — Reinforcement learningIntrinsic curiosity is a type of reinforcement learning in which the reward signal is generated internally by the agent rather than being…Jan 7, 2023
AndrewngaiReinforcement Learning Experiments in Different Arcade GamesKeywords: reinforcement learning; PPO2; A2C; Actor-Critic; Gym; arcade gamesMay 27
Sthanikam SanthoshReinforcement Learning(Part-7): Twin Delayed Deep Deterministic Policy Gradient(TD3) in Tensorflow2Many reinforcement learning (RL) algorithms have been proposed for solving various tasks in recent years. Among these, the twin delayed…Jun 12, 2022
Sanghavi harshNavigating the Evolution of Reinforcement Learning: A Historical PerspectiveTable of Contents:May 11