Homepage
Sign in / Sign up
Go to the profile of Rishabh Jangir
Rishabh Jangir
Jul 28, 2016
Hi Matt,
Mahmoud Hossam
2

I used Inverse RL to obtain the reward function for different expert behaviors, and then using the obtained reward functions to train an RL agent. I built over Matt’s code, here is my implementation https://jangirrishabh.github.io/2016/07/09/virtual-car-IRL/
I think this might be useful for you to teach to your students, let me know if you like it :-D

  • Go to the profile of Rishabh Jangir

    Rishabh Jangir

    • Share
    Go to the profile of Rishabh Jangir
    Never miss a story from Rishabh Jangir, when you sign up for Medium. Learn more
    Never miss a story from Rishabh Jangir