Online optimization in X-armed bandits (2008)

by Sébastien Bubeck, Rémi Munos, Gilles Stoltz, Csaba Szepesvári
Venue:In Advances in Neural Information Processing Systems 22