A class of gradient-estimating algorithms for reinforcement learning in neural networks (1987)

by R J Williams
Venue:In Proceedings of the IEEE First International Conference on Neural Networks