least-squares algorithms for temporal difference learning (1996)

by S J Bradtke, A G Barto, Linear
Venue:Machine Learning 22