Ryan Pégoud – Medium

Ryan Pégoud

Ryan Pégoud
in
Towards Data Science

Breaking down State-of-the-Art PPO Implementations in JAX

All the tricks and details you wish you knew about Proximal Policy Optimization

9 min readMay 1, 2024

--

1

Breaking down State-of-the-Art PPO Implementations in JAX

--

1

Ryan Pégoud
in
Towards Data Science

A Gentle Introduction to Deep Reinforcement Learning in JAX

Solving the CartPole environment with DQN in under a second

10 min readNov 21, 2023

--

2

A Gentle Introduction to Deep Reinforcement Learning in JAX

--

2

Ryan Pégoud
in
Towards Data Science

Implementing a Transformer Encoder from Scratch with JAX and Haiku 🤖

Understanding the fundamental building blocks of Transformers.

12 min readNov 7, 2023

--

3

Implementing a Transformer Encoder from Scratch with JAX and Haiku 🤖

--

3

Ryan Pégoud
in
Towards Data Science

Vectorize and Parallelize RL Environments with JAX: Q-learning at the Speed of Light⚡

Learn to vectorize a GridWorld environment and train 30 Q-learning agents in parallel on a CPU, at 1.8 million step per seconds!

11 min readOct 15, 2023

--

2

Vectorize and Parallelize RL Environments with JAX: Q-learning at the Speed of Light⚡

--

2

Ryan Pégoud
in
Towards Data Science

Temporal-Difference Learning and the importance of exploration: An illustrated guide

A comparison of model-free (Q-learning) and model-based (Dyna-Q and Dyna-Q+) TD methods on a dynamic grid world.

15 min readSep 23, 2023

--

2

Temporal-Difference Learning and the importance of exploration: An illustrated guide

--

2

Ryan Pégoud

Ryan Pégoud

Reinforcement Learning enthusiast. Computational Statistics and Machine Learning MSc @UCL (2024-2025)

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams