Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning (1999)

by Richard S Sutton, Doina Precup, Satinder Singh
Venue:Artificial Intelligence