Simple statistical gradient-following algorithms for connectionist reinforcement learning (1992)

by Ronald J. Williams
Venue:Machine Learning
Citations:262 - 0 self