Least-squares policy iteration (2003)

by M G Lagoudakis, R Parr
Venue:Journal of Machine Learning Research