Linear Least-Squares algorithms for temporal difference learning (1996)

by S J Bradtke, A G Barto
Venue:Machine Learning