Practical issues in temporal difference learning (1992)

by G Tesauro
Venue:Machine Learning 8:257–277