Least-squares policy iteration. (2003)

by M G Lagoudakis, R Parr
Venue:Journal of Machine Learning Research,