Published inAnalytics VidhyaFine-Tuning LLaMA 3.1 on Your Custom Dataset: A Comprehensive GuideThe recent surge in open weight models has opened up exciting opportunities for researchers and developers alike. LLaMA 3.1, one of the…Mar 10Mar 10
Published inAnalytics VidhyaHow to Write Good Quality Machine Learning CodeMachine learning has become an increasingly popular and powerful tool for solving a wide range of problems in areas such as image…Jan 5, 2023Jan 5, 2023
Published inAnalytics VidhyaRegularization — Understanding L1 and L2 regularization for Deep LearningUnderstanding what regularization is and why it is required for machine learning and diving deep to clarify the importance of L1 and L2…Nov 9, 20213Nov 9, 20213
Published inIntel Student AmbassadorsLive Graph Simulation using Python, Matplotlib and PandasMake live graphs with dynamic line, scatter and bar plots. Also learn to plot graphs in 3D and 2D quickly using pandas and csv.Apr 22, 20204Apr 22, 20204
Published inDataDrivenInvestorWhich Reinforcement learning-RL algorithm to use where, when and in what scenario?The what? why? when? and which? of Reinforcement learning algorithms and quick facts about existing reinforcement learning algorithms.Apr 14, 20202Apr 14, 20202
Published inAnalytics VidhyaDemystifying Deep Deterministic Policy Gradient(DDPG) using ChainerRL and OpenAI-baselinesAn in-depth explanation of DDPG, a popular Reinforcement learning technique and its breezy implementation using ChainerRL and Tensorflow.Nov 26, 2019Nov 26, 2019
Published inAnalytics VidhyaUnderstanding Proximal Policy Optimization (PPO) and it’s implementation on Mario Game EnvironmentWe shall learn the concept behind Proximal Policy Optimization (PPO) in the simple terms and then its implementation on a Mario…Aug 12, 20191Aug 12, 20191
Published inAnalytics VidhyaGame Of Thrones Episode script generation using LSTM and Recurrent cells in TensorflowGame of Thrones season 8 from LSTMS and AIAug 7, 2019Aug 7, 2019
Published inAnalytics VidhyaTF-Agents: A Flexible Reinforcement Learning Library for TensorFlowTF-Agents: A Flexible Reinforcement Learning Library for TensorFlow explained and tutorials given along with the code and linksAug 2, 2019Aug 2, 2019
Google football environment — Training using Asynchronous Advantage Actor-Critic (A3C) Part 2Co-Author: Vishal BidawatkaJul 5, 2019Jul 5, 2019