Approximate modified policy iteration. (2012)

by B Scherrer, M Ghavamzadeh, V Gabillon, M Geist
Venue:In Proceedings of the Twenty Ninth International Conference on Machine Learning,