Simple statistical gradient-following algorithms for connectionist reinforcement learning (1992)

by Ronald J. Williams
Venue:Machine Learning
Citations:319 - 0 self