Self-improving reactive agents based on reinforcement learning, planning and teaching (1992)

by Long-ji Lin
Venue:Machine Learning
Citations:275 - 2 self

Documents Related by Co-Citation

1226 Learning to predict by the methods of temporal differences – Richard S. Sutton - 1988
1309 Learning from Delayed Rewards – C Watkins - 1989
363 Practical Issues in Temporal Difference Learning – Gerald Tesauro - 1992
251 Generalization in Reinforcement Learning: Safely Approximating the Value Function – Justin A. Boyan, Andrew W. Moore - 1995
334 Automatic Programming of Behavior-based Robots using Reinforcement Learning – S. Mahadevan, J. Connell, C. Sammut, R. Sutton, Temporal Phd - 1991
153 Learning to predict by the methods of temporal diā†µerences – R S Sutton - 1988
2593 On the theory of dynamic programming – Richard E Bellman - 1952
175 Neuron-like elements that can solve difficult learning control problems – A Barto, R Sutton, C Anderson - 1983
133 Input generalization in delayed reinforcement learning: An algorithm and performance comparisons – David Chapman, Leslie Pack Kaelbling - 1991
242 Temporal credit assignment in reinforcement learning. Doctoral dissertation – R S Sutton - 1984
3760 Reinforcement Learning I: Introduction – Richard S. Sutton, Andrew G. Barto - 1998
285 On-Line Q-Learning Using Connectionist Systems – G. A. Rummery, M. Niranjan - 1994
355 Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding – Richard S. Sutton - 1996
69 Strategy Learning with Multilayer Connectionist Representations – Charles W. Anderson - 1987
61 Issues in Using Function Approximation for Reinforcement Learning – Sebastian Thrun , Anton Schwartz - 1993
88 Incremental Multi-Step Q-Learning – Jing Peng, Ronald J. Williams - 1996
186 Reinforcement Learning with Replacing Eligibility Traces – Satinder Singh, Richard S. Sutton - 1996
426 Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning – Richard Sutton, Doina Precup, Satinder Singh - 1999
1298 Reinforcement learning: a survey – Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore - 1996