Incremental natural actor-critic algorithms (2008)

by S Bhatnagar, R Sutton, M Ghavamzadeh, M Lee
Venue:Advances in Neural Information Processing Systems 20