Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms (1998)
by
Satinder Singh
,
Tommi Jaakkola
,
Michael L. Littman
,
Csaba Szepesvári
| Venue: | MACHINE LEARNING |
| Citations: | 91 - 5 self |







