Solving Semi-Markov Decision Problems using Average Reward Reinforcement Learning (1999)

by Tapas Das, Abhijit Gosavi, Sridhar Mahadevan, N. Marchalleck.
Venue:Management Science