On the Convergence of Policy Iteration (1979)

by M L Puterman, S Brumele
Venue:in Stationary Dynamic Programming” Mathematics of Operations Research