Finite-time analysis of the multi-armed bandit problem. (2002)

by P Auer, N Cesa-Bianchi, P Fischer
Venue:Machine Learning,