Hierarchical reinforcement learning with the MAXQ value function decomposition (2000)

by T Dietterich
Venue:Journal of Artificial Intelligence Research