Integrated architectures for learning, planning, and reacting based on approximating dynamic programming (1990)

by R S Sutton
Venue:Proceedings of the Seventh International Conference on Machine Learning (ML-90