Pure exploration and -exploitation, ϵ-greedy, Boltzmann exploration, optimistic initialization, confidence intervals, knowledge gradients — The exploration-exploitation dilemma is omnipresent in everyday life. Once you found a restaurant you like, you might decide to visit that very same restaurant for the rest of your life, exploiting your positive experience. However, there is a certain appeal in exploring new venues as well. Yes, you might get…