Learning to Act using Real-Time Dynamic Programming (1995)

by Andrew G. Barto , Steven J. Bradtke , Satinder P. Singh
Venue:ARTIFICIAL INTELLIGENCE
Citations:637 - 20 self