Motivated Reinforcement Learning (2001)

Cached

Download Links

by Peter Dayan
Citations:252 - 9 self

Documents Related by Co-Citation

187 Reinforcement learning for robots using neural networks – L-J Lin - 1992
160 Transfer of Learning by Composing Solutions of Elemental Sequential Tasks – Satinder Pal Singh - 1992
527 Learning to act using real-time dynamic programming – Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh - 1993
99 Hierarchical Learning in Stochastic Domains: Preliminary Results – Leslie Pack Kaelbling - 1993
1309 Learning from Delayed Rewards – C Watkins - 1989
207 Convergence of Stochastic Iterative Dynamic Programming Algorithms – Tommi Jaakkola, Michael I. Jordan, Satinder P. Singh - 1994
473 Integrated architectures for learning, planning, and reacting based on approximating dynamic programming – Richard S. Sutton - 1990
2593 On the theory of dynamic programming – Richard E Bellman - 1952
109 Decomposition Techniques for Planning in Stochastic Domains – Thomas Dean, Shieu-hong Lin - 1995
316 Prioritized sweeping: Reinforcement learning with less data and less time – Andrew W. Moore, Christopher G. Atkeson - 1993
1226 Learning to predict by the methods of temporal differences – Richard S. Sutton - 1988
513 Dynamic Programming and Markov Processes – R A Howard - 1960
278 Improving Elevator Performance Using Reinforcement Learning – Robert Crites, Andrew Barto - 1996
363 Practical Issues in Temporal Difference Learning – Gerald Tesauro - 1992
240 Reinforcement learning with hierarchies of machines – Ronald Parr, Stuart Russell - 1998
92 Efficient Learning and Planning Within the Dyna Framework – Jing Peng, Ronald J. Williams - 1993
3760 Reinforcement Learning I: Introduction – Richard S. Sutton, Andrew G. Barto - 1998
122 The MAXQ Method for Hierarchical Reinforcement Learning – Thomas G. Dietterich - 1998
114 Reinforcement Learning Methods for Continuous-Time Markov Decision Problems – Steven J. Bradtke, Michael O. Duff - 1994