Least-squares policy iteration (2003)

by M LAGOUDAKIS, R PARR
Venue:Journal of Machine Learning Research