Dyna, an Integrated Architecture for Learning, Planning, and Reacting (1991)

Cached

Download Links

by Richard S. Sutton
Venue:WORKING NOTES OF THE 1991 AAAI SPRING SYMPOSIUM
Citations:472 - 18 self

Documents Related by Co-Citation

1308 Learning from delayed rewards – Christopher J C H Watkins - 1989
1222 Learning to predict by the methods of temporal differences – Richard S. Sutton - 1988
303 Learning in Embedded Systems – L P Kaelbling - 1993
2587 Dynamic Programming – R E Bellman - 1957
242 Temporal Credit Assignment in Reinforcement Learning – R Sutton - 1984
334 Automatic Programming of Behavior-based Robots using Reinforcement Learning – S. Mahadevan, J. Connell, C. Sammut, R. Sutton, Temporal Phd - 1991
78 Variable resolution dynamic programming: Efficiently learning action maps in multivariate real-valued state-spaces – Andrew Moore - 1991
175 A Survey of Algorithmic Methods for Partially Observed Markov Decision Processes – W S Lovejoy - 1991
513 Dynamic Programming and Markov Processes – R A Howard - 1960
195 Learning and Sequential Decision Making – Andrew G. Barto, R. S. Sutton, C. J. C. H. Watkins - 1989
92 Efficient Learning and Planning Within the Dyna Framework – Jing Peng, Ronald J. Williams - 1993
526 Learning to act using real-time dynamic programming – Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh - 1993
610 Some studies in machine learning using the game of Checkers – Arthur L. Samuel - 1959
109 Real-time learning and control using asynchronous dynamic programming (Technical Report 91-57 – A G Bmdtke Barto, S J, S P Singh - 1991
64 Programming robots using reinforcement learning and teaching – L J Lin - 1991
191 A survey of partially observable Markov decision processes: Theory, models, and algorithms – G E Monahan - 1982
295 Forward models: Supervised learning with a distal teacher – Michael I. Jordan, David E. Rumelhart - 1992
133 Input generalization in delayed reinforcement learning: An algorithm and performance comparisons – David Chapman, Leslie Pack Kaelbling - 1991
60 Planning by Incremental Dynamic Programming – Richard S. Sutton - 1991