Gradient Descent for General Reinforcement Learning (1999)

by L Baird, A Moore
Venue:Adv. Neural Inf. Process Syst