Automatic Discovery and Transfer of MAXQ Hierarchies

by Neville Mehta , Soumya Ray , Prasad Tadepalli , Thomas Dietterich
Citations:21 - 1 self

Documents Related by Co-Citation

3760 Reinforcement Learning I: Introduction – Richard S. Sutton, Andrew G. Barto - 1998
426 Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning – Richard Sutton, Doina Precup, Satinder Singh - 1999
367 Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition – Thomas G. Dietterich - 2000
240 Reinforcement learning with hierarchies of machines – Ronald Parr, Stuart Russell - 1998
116 Automatic Discovery of Subgoals in Reinforcement Learning using Diverse Density – Amy Mcgovern, Andrew G. Barto - 2001
81 Discovering hierarchy in reinforcement learning with hexq – Bernhard Hengst - 2002
161 Recent advances in hierarchical reinforcement learning – Andrew G. Barto - 2003
50 Identifying useful subgoals in reinforcement learning by local graph partitioning – Alicia P. Wolfe, Andrew G. Barto - 2005
122 The MAXQ Method for Hierarchical Reinforcement Learning – Thomas G. Dietterich - 1998
40 Dynamic Abstraction in Reinforcement Learning via Clustering – Shie Mannor, Ishai Menache, Amit Hoze, Uri Klein - 2004
22 Causal Graph Based Decomposition of Factored MDPs – Anders Jonsson , Andrew Barto - 2006
580 Markov decision processes – M L Puterman - 2005
102 Finding Structure in Reinforcement Learning – Sebastian Thrun, Anton Schwartz - 1995
35 Multi-task reinforcement learning: A hierarchical bayesian approach – Aaron Wilson, Alan Fern, Soumya Ray, Prasad Tadepalli - 2007
1309 Learning from Delayed Rewards – C Watkins - 1989
59 Temporal Abstraction in Reinforcement Learning – Doina Precup - 2000
30 PolicyBlocks: An Algorithm for Creating Useful Macro-Actions in Reinforcement Learning – Marc Pickett, Andrew G. Barto - 2002
16 A Causal Approach to Hierarchical Decomposition in Reinforcement Learning – Anders Jonsson - 2006
226 Exploiting structure in policy construction – Craig Boutilier, Richard Dearden, Mois├ęs Goldszmidt - 1995