Finite time analysis of the multiarmed bandit problem (2002)

by P Auer, N Cesa-Bianchi, P Fischer
Venue:Machine Learning