Hi Matt,
Mahmoud Hossam
2
I used Inverse RL to obtain the reward function for different expert behaviors, and then using the obtained reward functions to train an RL agent. I built over Matt’s code, here is my implementation https://jangirrishabh.github.io/2016/07/09/virtual-car-IRL/
I think this might be useful for you to teach to your students, let me know if you like it :-D