Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms (1998)

by Satinder Singh , Tommi Jaakkola , Michael L. Littman , Csaba Szepesvári
Venue:MACHINE LEARNING
Citations:151 - 7 self