Convergence of Stochastic Iterative Dynamic Programming Algorithms (1994)

by Tommi Jaakkola, Michael I. Jordan, Satinder P. Singh
Venue:Neural Computation