Integrated architectures for learning, planning, and reacting based on approximating dynamic programming (1990)

by R Sutton
Venue:In Proceedings of International Conference on Machine Learning