A unified analysis of value-function-based reinforcementlearning algorithms. Neural Computation (1997)

by Csaba Szepesvari , Michael L. Littman
Citations:32 - 7 self