Policy search by dynamic programming (2003)

by J. Andrew Bagnell , Andrew Y. Ng , Sham Kakade , Jeff Schneider
Venue:in Advances in Neural Information Processing Systems
Citations:38 - 2 self