Learning to Act using Real-Time Dynamic Programming (1995)

by Andrew G. Barto , Steven J. Bradtke , Satinder P. Singh
Venue:ARTIFICIAL INTELLIGENCE
Citations:527 - 18 self