Gradient Descent for General Reinforcement Learning (1998)

by Leemon Baird , Andrew Moore
Venue:In Advances in Neural Information Processing Systems 11
Citations:126 - 0 self