Learning without state-estimation in partially observable markovian decision processes (1994)

by S P Singh, T Jaakkola, M I Jordan
Venue:In Proceedings of the Eleventh International Conference on Machine Learning