Amresh VenugopalinSkitImplementing Deep Deterministic Policy Gradient with Unity’s ML-AgentsEnvironmentNov 26, 2018Nov 26, 2018
Amresh VenugopalinSkitNavigating through Unity’s ML Agent’s environment using DQNUnity recently released v0.5 of its ML-Agents toolkit, and I am really excited for using them for learning and research.Oct 28, 2018Oct 28, 2018
Amresh VenugopalinSkitAttempting Open-ai’s taxi-v2 using the SARSA-max algorithmOpen ai gym, as it says on its documentation is “…a toolkit for developing and comparing Reinforcement learning algorithms”. I selected the…Oct 2, 20182Oct 2, 20182
Amresh VenugopalinSkitSeeing the world in three dimensionsA chemical explosion left a child blind at the age of three, this child became a successful businessman and a championship paralympic…Sep 22, 2018Sep 22, 2018
Amresh VenugopalinSkitReinforcement Learning: Train a bot to play tic-tac-toe.Reinforcement learning is learning what to do — how to map situations to actions — so as to maximize a reward.Sep 9, 20183Sep 9, 20183