Ryan Pégoud – Medium

Ryan Pégoud

Ryan Pégoud
in
Towards Data Science

Breaking down State-of-the-Art PPO Implementations in JAX

All the tricks and details you wish you knew about Proximal Policy Optimization

May 1

Breaking down State-of-the-Art PPO Implementations in JAX

May 1

Ryan Pégoud
in
Towards Data Science

A Gentle Introduction to Deep Reinforcement Learning in JAX

Solving the CartPole environment with DQN in under a second

Nov 21, 2023

A Gentle Introduction to Deep Reinforcement Learning in JAX

Nov 21, 2023

Ryan Pégoud
in
Towards Data Science

Implementing a Transformer Encoder from Scratch with JAX and Haiku 🤖

Understanding the fundamental building blocks of Transformers.

Nov 7, 2023

Implementing a Transformer Encoder from Scratch with JAX and Haiku 🤖

Nov 7, 2023

Ryan Pégoud
in
Towards Data Science

Vectorize and Parallelize RL Environments with JAX: Q-learning at the Speed of Light⚡

Learn to vectorize a GridWorld environment and train 30 Q-learning agents in parallel on a CPU, at 1.8 million step per seconds!

Oct 15, 2023

Vectorize and Parallelize RL Environments with JAX: Q-learning at the Speed of Light⚡

Oct 15, 2023

Ryan Pégoud
in
Towards Data Science

Temporal-Difference Learning and the importance of exploration: An illustrated guide

A comparison of model-free (Q-learning) and model-based (Dyna-Q and Dyna-Q+) TD methods on a dynamic grid world.

Sep 23, 2023

Temporal-Difference Learning and the importance of exploration: An illustrated guide

Sep 23, 2023

Ryan Pégoud

Ryan Pégoud

Reinforcement Learning enthusiast. Computational Statistics and Machine Learning MSc @UCL (2024-2025)

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams