Error Propagation for Approximate Policy and Value Iteration. (2010)

by Amir-massoud Farahmand, Csaba Szepesvari, Remi Munos
Venue:In Advances in Neural Information Processing Systems,