Learning policies for partially observable environments: Scaling up (1995)

by M L Littman, A R Cassandra, L P Kaelbling
Venue:In ICML’95