Ryan PégoudinTowards Data ScienceRainbow: The Colorful Evolution of Deep Q-Networks 🌈Everything you need to assemble the DQN Megazord in JAX.Jul 12Jul 12
Ryan PégoudinTowards Data ScienceA Practical Guide to Proximal Policy Optimization in JAXAll the tricks and details you wish you knew about PPOMay 11May 11
Ryan PégoudinTowards Data ScienceA Gentle Introduction to Deep Reinforcement Learning in JAXSolving the CartPole environment with DQN in under a secondNov 21, 20232Nov 21, 20232
Ryan PégoudinTowards Data ScienceImplementing a Transformer Encoder from Scratch with JAX and Haiku 🤖Understanding the fundamental building blocks of Transformers.Nov 7, 20233Nov 7, 20233
Ryan PégoudinTowards Data ScienceVectorize and Parallelize RL Environments with JAX: Q-learning at the Speed of Light⚡Learn to vectorize a GridWorld environment and train 30 Q-learning agents in parallel on a CPU, at 1.8 million step per seconds!Oct 15, 20232Oct 15, 20232
Ryan PégoudinTowards Data ScienceTemporal-Difference Learning and the importance of exploration: An illustrated guideA comparison of model-free (Q-learning) and model-based (Dyna-Q and Dyna-Q+) TD methods on a dynamic grid world.Sep 23, 20233Sep 23, 20233