Predicting Unseen Triphones With Senones (1993)
| Citations: | 37 - 9 self |
BibTeX
@INPROCEEDINGS{Hwang93predictingunseen,
author = {Mei-yuh Hwang and Xuedong Huang and Fileno Alleva},
title = {Predicting Unseen Triphones With Senones},
booktitle = {},
year = {1993},
pages = {311--314}
}
Years of Citing Articles
OpenURL
Abstract
In large-vocabulary speech recognition, the decoder often encounters triphones that are not covered in the training data. These unseen triphones are usually represented by corresponding diphones or context independent monophones. We propose to use decision-tree based senones to generate needed senonic baseforms for unseen triphones. A decision tree is built for each individual Markov state of each phone, and the leaves of the trees constitute the senone codebook. To find the senone a Markov state of any triphone is associated with, we traverse the corresponding tree until we reach a leaf node, where a senone is represented. We used the DARPA 5,000-word speaker-independent Wall Street Journal dictation task to evaluate the proposed method. The word error rate was reduced by 11% when unseen triphones were modeled by the decision-tree based senones. When there were at least 5 unseen triphones in each test utterance, the error rate could be reduced by more than 20%. This research was spons...







