Learning to optimize via posterior sampling. (2014)

by D Russo, B Van Roy
Venue:Mathematics of Operations Research,