Learning control of finite Markov chains with explicit trade-off between estimation and control (1988)

by M Sato, K &Takeda Abe, H
Venue:IEEE Transactions on Systems, Man and Cybernetics