List: RL | Curated by Mark | Medium

Oct 17, 2023

4 stories

RL

Wouter van Heeswijk, PhD
in
Towards Data Science

Natural Policy Gradients In Reinforcement Learning Explained

Traditional policy gradient methods are inherently flawed. Natural gradients converge quicker and better, forming the foundation of…

Sep 2, 2022

Natural Policy Gradients In Reinforcement Learning Explained

Sep 2, 2022

Nicolo Cosimo Albanese
in
Towards Data Science

Dynamic Pricing with Reinforcement Learning from Scratch: Q-Learning

An introduction to Q-Learning with a practical Python example

Aug 26, 2023

Dynamic Pricing with Reinforcement Learning from Scratch: Q-Learning

Aug 26, 2023

Subash Palvel

Introduction to Transfer Learning in Reinforcement Learning

Transfer learning is a powerful technique that allows us to leverage knowledge gained from one task to improve performance on another…

Sep 18, 2023

Sep 18, 2023

Wouter van Heeswijk, PhD
in
Towards Data Science

Trust Region Policy Optimization (TRPO) Explained

The Reinforcement Learning algorithm TRPO builds upon natural policy gradient algorithms, ensuring updates remain within ‘trustworthy’…

Oct 12, 2022

Trust Region Policy Optimization (TRPO) Explained

Oct 12, 2022

Mark

Mark

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams