Real-time Learning and Control Using Asynchronous Dynamic Programming (1995)

by A G Barto, S J Bradtke, S P Singh
Venue:Aritificial Intelligence