Efficient exploration and value function generalization in deterministic systems (2013)

by Zheng Wen, Benjamin Van Roy
Venue:In Advances in Neural Information Processing Systems 26