Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition (2000)

by Thomas G. Dietterich
Venue:Journal of Artificial Intelligence Research
Citations:370 - 6 self