Learning to Act using Real-Time Dynamic Programming (1993)

by Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh, The Thank Rich Yee, Vijay Gullapalli, Brian Pinette