Regularized policy iteration. In (2009)

by Szepesvari, Sh Mannor
Venue:Advances in Neural Information Processing Systems 21,