Multi-armed bandit algorithms and empirical evaluation (2005)

by J Vermorel, M Mohri
Venue:In European Conf. on Machine Learning