Integrated architecture for learning, planning, and reacting based on approximating dynamic programming (1990)

by Richard S Sutton
Venue:In Proceedings of the seventh international conference (1990) on Machine learning