Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition (2000)

by Thomas G. Dietterich
Venue:Journal of Artificial Intelligence Research
Citations:367 - 6 self