Modelfree reinforcement learning for non-markovian decision problems (1994)

by Satinder Pal Singh, Tommi Jaakkola, Michael I Jordan
Venue:In Proceedings of the Machine Learning Conference