Analytics Vidhya
Published in

Analytics Vidhya

My Reinforcement Learning Adventures

Image of one of my experiment runs

Version 1

Version 1 environment test

Environment

  • Action space
  • Observation space
  • Basic model explanation

Results

  • The main problem is that there doesn’t seem to be much coordination between players
  • The environment is too simple to see military tactics. Like moving diagonally left down is not faster than moving left then down.
  • Also, the policy and the value stopped learning after a bit. At the time I made this version, I assumed that the environment was too simple but now I think it is because the observation space was encoded in such an overcomplicated way.

Version 2

Version 2 environment test

Environment

  • Action space
  • Observation space
  • Basic model explanation

Results

  • Players can move randomly without coordination
  • The wings attack mechanism is too complicated
  • Like before, the observation space is too complicated
  • And finally, this is applicable to version 1 too but the action space is a bit too complicated to be human-understandable.

Version 3

Version 3 environment test. The images next to each other are what each model sees. So, the top left video is what the blue side sees and the one next to it is what the red guys see.

Environment

  • Action space
  • Observation space
  • QOL changes
Spring pulling visualization
  • Basic model explanation

Results

Policy loss

Version 4

Version 4 environment test run. As you can see, it isn’t going well.
  • Action space
  • Observation space
  • QOL changes

Results

Reward plot with different models
Policy loss plot with different models

Version 5(Current)

Version 5 environment test run
  • Action space
  • Observation space

Results

Reward plot with different models
Policy loss plot with different models

Next Steps

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store