|
12
|
A Causal Approach to Hierarchical Decomposition in Reinforcement Learning
– Anders Jonsson
- 2006
|
|
2
|
Automatic Induction of MAXQ Hierarchies
– Neville Mehta, Mike Wynkoop, Soumya Ray, Prasad Tadepalli, Tom Dietterich
|
|
|
Autonomous Qualitative Learning of Distinctions and Actions in a Developing Agent
– Jonathan Mugan
|
|
|
Programming and Reinforcement Learning, 105–111. IEEE. Grounding Subgoals in Information Transitions
– unknown authors
|
|
18
|
Causal Graph Based Decomposition of Factored MDPs
– Anders Jonsson , Andrew Barto
- 2006
|
|
|
Structured Exploration for Reinforcement Learning
– Nicholas Kenneth Jong
- 2010
|
|
|
and Search General Terms
– Gheorghe Comanici, Doina Precup
|
|
|
Behavioral Building Blocks FOR AUTONOMOUS AGENTS: . . .
– Özgür Simsek
- 2008
|
|
32
|
Identifying useful subgoals in reinforcement learning by local graph partitioning
– Alicia P. Wolfe, Andrew G. Barto
- 2005
|
|
32
|
Building portable options: Skill transfer in reinforcement learning
– George Konidaris, Andrew Barto
- 2007
|
|
4
|
Automated Discovery of Options in Reinforcement Learning
– Martin Stolle
- 2004
|
|
3
|
Hierarchical Reinforcement Learning: A Hybrid Approach
– Malcolm Ross Kinsella Ryan
- 2002
|
|
28
|
An Algebraic Approach to Abstraction in Reinforcement Learning
– Balaraman Ravindran, Andrew Barto
- 2003
|
|
|
Clustering via Dirichlet Process Mixture Models for Portable Skill Discovery
– Scott Niekum, Andrew G. Barto
|
|
3
|
Discovering Options from Example Trajectories
– Peng Zang, Peng Zhou, David Minnen, Charles Isbell
|
|
1
|
Hierarchical Reinforcement Learning Using Graphical Models
– Victoria Manfredi, Sridhar Mahadevan
|
|
11
|
Skill Discovery in Continuous Reinforcement Learning Domains using Skill Chaining
– George Konidaris, Andrew G. Barto
|
|
19
|
An intrinsic reward mechanism for efficient exploration
– Andrew G. Barto
- 2006
|
|
|
Automatic construction of temporally extended actions for MDPs
– Pablo Samuel Castro, Doina Precup
|