Learning without stateestimation in partially observable markovian decision processes (1994)

by Satinder P Singh, Tommi Jaakkola, Michael I Jordan
Venue:In In Proceedings of the Eleventh International Conference on Machine Learning