State Abstraction in MAXQ Hierarchical Reinforcement Learning (2000)

Cached

Download Links

by Thomas Dietterich Department , Thomas G. Dietterich
Venue:Advances in Neural Information Processing Systems 12

Active Bibliography

367 Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition – Thomas G. Dietterich - 2000
2 A Unifying Framework for Temporal Abstraction in Stochastic Processes – Ronald Parr - 1998
– Learning Problems and Learning Algorithms – Dealing with Suboptimality – State Abstraction – Thomas G. Dietterich, Thomas G. Dietterich, Visiting Senior Scientist
59 Temporal Abstraction in Reinforcement Learning – Doina Precup - 2000
108 Hierarchical Control and Learning for Markov Decision Processes – Ronald Edward Parr - 1998
426 Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning – Richard Sutton, Doina Precup, Satinder Singh - 1999
55 Between MDPs and semi-MDPs: Learning, planning, and representing knowledge at multiple temporal scales – Richard S. Sutton, Doina Precup - 1998
1 HIERARCHICAL REINFORCEMENT LEARNING WITH FUNCTION APPROXIMATION FOR ADAPTIVE CONTROL – Margaret Mary Skelly
3 Hierarchical Reinforcement Learning: A Hybrid Approach – Malcolm Ross Kinsella Ryan - 2002
122 The MAXQ Method for Hierarchical Reinforcement Learning – Thomas G. Dietterich - 1998
19 Using Options for Knowledge Transfer in Reinforcement Learning – Theodore J. Perkins, Doina Precup - 1999
Solving Large Markov . . . – Yilan Gu - 2003
36 An Overview of MAXQ Hierarchical Reinforcement Learning – Thomas G. Dietterich - 2000
11 Algorithms for Partially Observable Markov Decision Processes – Weihong Zhang - 2001
47 Flexible Decomposition Algorithms for Weakly Coupled Markov Decision Problems – Ronald Parr - 1998
4 Learning Hierarchical Behaviors – David Andre - 1998
5 High-Level Robot Programming in Dynamic and Incompletely Known Environments – Mikhail Soutchanski - 2003
8 Event-Learning And Robust Policy Heuristics – András Lörincz, Imre Pólik, István Szita - 2001
52 Intra-option learning about temporally abstract actions – Richard S. Sutton, Doina Precup - 1998