Least-squares temporal difference learning. (1999)

by J A Boyan
Venue:In Proc. of ICML 16,