Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning (1999)

by Richard Sutton, Doina Precup, Satinder Singh
Venue:Artificial Intelligence