The continuum-armed bandit problem (1926)

by R Agrawal
Venue:SIAM J. Control and Optimization