Ryan PégoudinTowards Data ScienceBreaking down State-of-the-Art PPO Implementations in JAXAll the tricks and details you wish you knew about Proximal Policy OptimizationMay 11May 11
Ryan PégoudinTowards Data ScienceA Gentle Introduction to Deep Reinforcement Learning in JAXSolving the CartPole environment with DQN in under a secondNov 21, 20232Nov 21, 20232
Ryan PégoudinTowards Data ScienceImplementing a Transformer Encoder from Scratch with JAX and Haiku 🤖Understanding the fundamental building blocks of Transformers.Nov 7, 20233Nov 7, 20233
Ryan PégoudinTowards Data ScienceVectorize and Parallelize RL Environments with JAX: Q-learning at the Speed of Light⚡Learn to vectorize a GridWorld environment and train 30 Q-learning agents in parallel on a CPU, at 1.8 million step per seconds!Oct 15, 20232Oct 15, 20232
Ryan PégoudinTowards Data ScienceTemporal-Difference Learning and the importance of exploration: An illustrated guideA comparison of model-free (Q-learning) and model-based (Dyna-Q and Dyna-Q+) TD methods on a dynamic grid world.Sep 23, 20232Sep 23, 20232