See more
First, we must stop learning while interacting with the environment. We should try different things and play a little randomly to explore the state space. We can save t…