Finite-time Regret Bounds for the Multiarmed Bandit Problem (1998)
by
Nicolò Cesa-Bianchi
,
Paul Fischer
| Venue: | In 5th International Conference on Machine Learning |
| Citations: | 6 - 0 self |







