Learning without stateestimation in partially observable Markovian decision processes (1994)

by S P Singh, T Jaakkola, M I Jordan
Venue:in Proceedings of the eleventh international conference on machine learning