Real-time learning and control using asynchronous dynamic programming (Technical Report 91-57 (1991)

by A G Bmdtke Barto, S J, S P Singh