Learning to act using real-time dynamic programming (1995)

by Andrew G. Barto , Steven J. Bradtke , Satinder P. Singh , The Thank Rich Yee , Vijay Gullapalli , Brian Pinette
Venue:Artificial Intelligence
Citations:526 - 18 self