Hierarchical reinforcement learning with the Maxq value function decomposition (2000)

by T G Dietterich
Venue:Journal of Artificial Intelligence Research