Realtime learning and control using asynchronous dynamic programmming (1991)

by A G Baro, S J Bradtke, S P Singh
Venue:University of Massachusetts at Amherst