Error Propagation for Approximate Policy and Value Iteration

by Amir Massoud Farahmand, Rémi Munos, Csaba Szepesvári