Temporal Difference Learning and TDGammon (1995)

by G Tesauro
Venue:Communications of the ACM