(More) efficient reinforcement learning via posterior sampling. (2013)

by Ian Osband, Dan Russo, Benjamin Van Roy
Venue:In Advances in Neural Information Processing Systems,