AN SVM FRONT-END LANDMARK SPEECH RECOGNITION SYSTEM (2008)
| Citations: | 3 - 1 self |
BibTeX
@MISC{Borys08ansvm,
author = {Sarah E. Borys},
title = {AN SVM FRONT-END LANDMARK SPEECH RECOGNITION SYSTEM },
year = {2008}
}
OpenURL
Abstract
Support vector machines (SVMs) can be trained to detect manner transitions between phones and to identify the manner and place of articulation of any given phone. The SVMs can perform these tasks with high accuracy using a variety of acoustic representations. The SVMs generalize well to unseen test data if these data were created under identical conditions to the training corpus. Unseen acoustic data from different corpora present a problem for the SVM, even if these acoustic data were generated under similar conditions. The discriminant outputs of these SVMs are used to create both a hybrid SVM/HMM (hidden Markov model) phone recogni-tion system and a hybrid SVM/HMM word recognition system. There is a significant improvement in both phone and word recognition accuracy when these SVM discrim-inant features are used instead of mel frequency cepstral coefficients (MFCCs).







