Learning to Act using Real-Time Dynamic Programming (1995)

by Andrew G. Barto , Steven J. Bradtke , Satinder P. Singh
Venue:ARTIFICIAL INTELLIGENCE
Citations:472 - 17 self