Learning and sequential decision making (1989)

by AG Barto, RS Sutton, CJCH Watkins
Venue:Learning and Computational Neuroscience: Foundations of Adaptive Networks