## Input/output hmms for sequence processing (1996)

Venue: | IEEE Transactions on Neural Networks |

Citations: | 103 - 12 self |

### BibTeX

@ARTICLE{Bengio96input/outputhmms,

author = {Yoshua Bengio and Paolo Frasconi},

title = {Input/output hmms for sequence processing},

journal = {IEEE Transactions on Neural Networks},

year = {1996},

volume = {7},

pages = {1231--1249}

}

### Years of Citing Articles

### OpenURL

### Abstract

We consider problems of sequence processing and propose a solution based on a discrete state model in order to represent past context. Weintroduce a recurrent connectionist architecture having a modular structure that associates a subnetwork to each state. The model has a statistical interpretation we call Input/Output Hidden Markov Model (IOHMM). It can be trained by the EM or GEM algorithms, considering state trajectories as missing data, which decouples temporal credit assignment and actual parameter estimation. The model presents similarities to hidden Markov models (HMMs), but allows us to map input se-quences to output sequences, using the same processing style as recurrent neural networks. IOHMMs are trained using a more discriminant learning paradigm than HMMs, while potentially taking advantage of the EM algorithm. We demonstrate that IOHMMs are well suited for solving grammatical inference problems on a benchmark problem. Experimental results are presented for the seven Tomita grammars, showing that these adaptive models can attain excellent generalization.