Multi-armed bandit algorithms and empirical evaluation (2005)

by Joannès Vermorel, Mehryar Mohri
Venue:In European Conference on Machine Learning