Learning to act using real-time dynamic programming (1995)

by A G Barto, S J Bradtke, S P Singh
Venue:Journal of AI