Learning to act using real-time dynamic programming (1995)

by A Barto, S Bradtke, S Singh
Venue:Arti®cial Intelligence