Dyna, an Integrated Architecture for Learning, Planning, and Reacting (1991)

Cached

Download Links

by Richard S. Sutton
Venue:WORKING NOTES OF THE 1991 AAAI SPRING SYMPOSIUM
Citations:473 - 18 self

Documents Related by Co-Citation

1309 Learning from Delayed Rewards – C Watkins - 1989
1226 Learning to predict by the methods of temporal differences – Richard S. Sutton - 1988
304 Learning in Embeded Systems – L P Kaelbling - 1993
2593 On the theory of dynamic programming – Richard E Bellman - 1952
242 Temporal credit assignment in reinforcement learning. Doctoral dissertation – R S Sutton - 1984
334 Automatic Programming of Behavior-based Robots using Reinforcement Learning – S. Mahadevan, J. Connell, C. Sammut, R. Sutton, Temporal Phd - 1991
78 Variable resolution dynamic programming: Efficiently learning action maps in multivariate real-valued state-spaces – Andrew Moore - 1991
175 A survey of algorithmic methods for partially observable Markov decision processes – W S Lovejoy - 1991
513 Dynamic Programming and Markov Processes – R A Howard - 1960
195 Learning and Sequential Decision Making – Andrew G. Barto, R. S. Sutton, C. J. C. H. Watkins - 1989
92 Efficient Learning and Planning Within the Dyna Framework – Jing Peng, Ronald J. Williams - 1993
527 Learning to act using real-time dynamic programming – Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh - 1993
612 Some studies in machine learning using the game of Checkers – Arthur L. Samuel - 1959
109 Real-time Learning and Control Using Asynchronous Dynamic Programming – A G Barto, S J Bradtke, S P Singh - 1995
64 Programming robots using reinforcement learning and teaching – L J Lin - 1991
191 A survey of partially observable Markov decision processes: Theory, models, and algorithms – G E Monahan - 1982
295 Forward models: Supervised learning with a distal teacher – Michael I. Jordan, David E. Rumelhart - 1992
133 Input generalization in delayed reinforcement learning: An algorithm and performance comparisons – David Chapman, Leslie Pack Kaelbling - 1991
60 Planning by Incremental Dynamic Programming – Richard S. Sutton - 1991