Direct gradient-based reinforcement learning (1999)

by J Baxter, P Bartlett
Venue:Journal of Artificial Inteligence Reseach