Convergent Combinations of Reinforcement Learning with Linear Function Approximation (2002)

by Ralf Schoknecht, Artur Merke
Venue:In Proceedings of the 15th Neural Information Processing Systems conference