Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding (1996)

by Richard S. Sutton
Venue:Advances in Neural Information Processing Systems 8
Citations:354 - 18 self

Documents Related by Co-Citation

1321 Learning from Delayed Rewards – C Watkins - 1989
3773 Reinforcement Learning I: Introduction – Richard S. Sutton, Andrew G. Barto - 1998
1227 Learning to predict by the methods of temporal differences – Richard S. Sutton - 1988
287 On-Line Q-Learning Using Connectionist Systems – G. A. Rummery, M. Niranjan - 1994
187 Reinforcement Learning with Replacing Eligibility Traces – Satinder Singh, Richard S. Sutton - 1996
224 The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces – Andrew W. Moore, Christopher G. Atkeson - 1995
207 Stable Function Approximation in Dynamic Programming – Geoffrey J. Gordon - 1995
251 Generalization in Reinforcement Learning: Safely Approximating the Value Function – Justin A. Boyan, Andrew W. Moore - 1995
319 Policy Gradient Methods for Reinforcement Learning with Function Approximation – Richard S. Sutton, David Mcallester, Satinder Singh, Yishay Mansour - 1999
226 TD-gammon, a self-teaching backgammon program achieves masterlevel play – G Tesauro - 1994
217 An analysis of temporal-difference learning with function approximation – John N. Tsitsiklis, Benjamin Van Roy - 1997
207 Convergence of Stochastic Iterative Dynamic Programming Algorithms – Tommi Jaakkola, Michael I. Jordan, Satinder P. Singh - 1994
1303 Reinforcement learning: a survey – Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore - 1996
237 Residual Algorithms: Reinforcement Learning with Function Approximation – Leemon Baird - 1995
279 Improving Elevator Performance Using Reinforcement Learning – Robert Crites, Andrew Barto - 1996
314 Prioritized sweeping: Reinforcement learning with less data and less time – Andrew W. Moore, Christopher G. Atkeson - 1993
153 Asynchronous Stochastic Approximation and Q-Learning – John N. Tsitsiklis, Richard Sutton - 1994
153 Learning to predict by the methods of temporal di erences – R S Sutton - 1988
368 Practical Issues in Temporal Difference Learning – Gerald Tesauro - 1992