Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding (1996)

by Richard S. Sutton
Venue:Advances in Neural Information Processing Systems 8
Citations:355 - 18 self

Documents Related by Co-Citation

1309 Learning from Delayed Rewards – C Watkins - 1989
3760 Reinforcement Learning I: Introduction – Richard S. Sutton, Andrew G. Barto - 1998
1226 Learning to predict by the methods of temporal differences – Richard S. Sutton - 1988
285 On-Line Q-Learning Using Connectionist Systems – G. A. Rummery, M. Niranjan - 1994
186 Reinforcement Learning with Replacing Eligibility Traces – Satinder Singh, Richard S. Sutton - 1996
224 The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces – Andrew W. Moore, Christopher G. Atkeson - 1995
208 Stable Function Approximation in Dynamic Programming – Geoffrey J. Gordon - 1995
251 Generalization in Reinforcement Learning: Safely Approximating the Value Function – Justin A. Boyan, Andrew W. Moore - 1995
319 Policy Gradient Methods for Reinforcement Learning with Function Approximation – Richard S. Sutton, David Mcallester, Satinder Singh, Yishay Mansour - 1999
224 TD-gammon, a self-teaching backgammon program, achieves master-level play – G J Tesauro - 1994
218 An analysis of temporal-difference learning with function approximation – John N. Tsitsiklis, Benjamin Van Roy - 1997
207 Convergence of Stochastic Iterative Dynamic Programming Algorithms – Tommi Jaakkola, Michael I. Jordan, Satinder P. Singh - 1994
1298 Reinforcement learning: a survey – Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore - 1996
237 Residual Algorithms: Reinforcement Learning with Function Approximation – Leemon Baird - 1995
278 Improving Elevator Performance Using Reinforcement Learning – Robert Crites, Andrew Barto - 1996
316 Prioritized sweeping: Reinforcement learning with less data and less time – Andrew W. Moore, Christopher G. Atkeson - 1993
151 Asynchronous Stochastic Approximation and Q-Learning – John N. Tsitsiklis, Richard Sutton - 1994
153 Learning to predict by the methods of temporal diā†µerences – R S Sutton - 1988
363 Practical Issues in Temporal Difference Learning – Gerald Tesauro - 1992