Modified policy iteration algorithms for discounted Markov decision problems (1978)

by Martin L Puterman, M C Shin
Venue:Management Science