Markov Decision Processes: Discrete Dynamic Stochastic Programming (1994)

by M L Puterman