Sep 2, 2018 · 1 min read
I’ve disabled the replay, aka “learning”, and exploration, set it to 0. I don’t see any improvement after I enable learning or exploration. Is that normal? Basically, the network is not doing anything and by default it’s getting scores of 501 quite often. EDIT: It was loading previously saved weights -_-