Learning control of finite Markov chains with explicit trade-off between estimation and control (1990)

by M Sato, K Abe, H Takeda
Venue:In Connectionist Models, Proceedings of the 1990 Summer School