A class of gradient-estimating algorithms for reinforcement learning in neural networks (1987)

by Ronald J Williams
Venue:In IEEE First International Conference on Neural Networks