On the convergence of stochastic iterative dynamic programming algorithms (1994)

by T Jaakkola, M I Jordan, S P Singh
Venue:Neural Comput