Gradient descent for general reinforcement learning (1999)

by L C Baird, A W Moore
Venue:In Advances in Neural Information Processing Systems11