Saurav Sharma
Aug 23, 2017 · 1 min read

Hi Arthur,

How do I identify which actionNumber in env.step(actionNumber) belongs to which action type out of left, right,up,down. Also since the first value returned by env.step() is the new state, so if for a period of time say 10 or 20 iterations I get different state value for the same actionNumber. I am confused.

Amazing post for the beginners. Cheers!!

)
Welcome to a place where words matter. On Medium, smart voices and original ideas take center stage - with no ads in sight. Watch
Follow all the topics you care about, and we’ll deliver the best stories for you to your homepage and inbox. Explore
Get unlimited access to the best stories on Medium — and support writers while you’re at it. Just $5/month. Upgrade