Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning (1999)

Cached

Download Links

by Richard Sutton , Doina Precup , Satinder Singh
Venue:Artificial Intelligence
Citations:342 - 22 self

Documents Related by Co-Citation

212 Reinforcement learning with hierarchies of machines – Ronald Parr, Stuart Russell - 1998
2829 Reinforcement Learning I: Introduction – Richard S. Sutton, Andrew G. Barto - 1998
307 Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition – Thomas G. Dietterich - 2000
98 Hierarchical Control and Learning for Markov Decision Processes – Ronald Edward Parr - 1998
256 Self-improving reactive agents based on reinforcement learning, planning and teaching – Long-ji Lin - 1992
9 Temporal abstraction in reinforcement learning. Doctoral dissertation – D Precup - 2000
65 Discovering Hierarchy in Reinforcement Learning with HEXQ – Bernhard Hengst - 2002
1134 Reinforcement learning: a survey – Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore - 1996
102 The MAXQ Method for Hierarchical Reinforcement Learning – Thomas G. Dietterich - 1998
222 Motivated Reinforcement Learning – Peter Dayan - 2001
94 A heuristic approach to the discovery of macro-operators – Glenn A Iba - 1989
87 Programmable reinforcement learning agents – David Andre, Stuart J. Russell - 2001
226 On-Line Q-Learning Using Connectionist Systems – G. A. Rummery, M. Niranjan - 1994
99 Reinforcement Learning Methods for Continuous-Time Markov Decision Problems – Steven J. Bradtke, Michael O. Duff - 1994
396 A model for reasoning about persistence and causation – Thomas Dean, Keiji Kanazawa - 1990
43 Learning hierarchical control structures for multiple tasks and changing environments – B L Digney - 1998
134 Policy invariance under reward transformations: Theory and application to reward shaping – Andrew Y. Ng, Daishi Harada, Stuart Russell - 1999
42 Autonomous Discovery Of Temporal Abstractions From Interaction With An Environment – Elizabeth Amy Mcgovern, Neil E. Berthier, Roderic A. Grupen, J. Eliot, B. Moss, Elizabeth Amy Mcgovern, W. Bruce Croft, Department Chair - 2002
4071 C4.5: Programs for machine learning – J Quinlan - 1993