Finite-sample analysis of least-squares policy iteration. (2012)

by A Lazaric, M Ghavamzadeh, R Munos
Venue:Journal of Machine Learning Research,