Results 11 -
13 of
13
A Public Domain Decoder For Large Vocabulary Conversational Speech Recognition
, 1999
"... The high cost of the infrastructure required to conduct state-of-the-art speech recognition research prevents many small research groups from evaluating new ideas on large-scale tasks. To overcome this barrier, we are developing an Internet-based speechto -text (STT) toolkit. In this paper, we prese ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
The high cost of the infrastructure required to conduct state-of-the-art speech recognition research prevents many small research groups from evaluating new ideas on large-scale tasks. To overcome this barrier, we are developing an Internet-based speechto -text (STT) toolkit. In this paper, we present the core component of this system: a decoder that uses a one-pass time-synchronous Viterbi-based search algorithm called trace projection. This decoder can support efficient lattice rescoring using cross-word triphones, lexical trees and n-gram grammars. The decoder performance in terms of CPU and memory usage is on par with commercial systems of its kind. Preliminary evaluations on the SWITCHBOARD (SWB) corpus have yielded a word error rate of 39%. 1. INTRODUCTION A speech-to-text (STT) system conceptually consists of three subsystems --- an acoustic processor which converts the speech signal into a sequence of feature vectors modeled using Hidden Markov Models (HMMs); a linguistic proc...
The Time-Conditioned Approach in Dynamic Programming Search for LVCSR
"... Abstract—This paper presents the time-conditioned approach in dynamic programming search for large-vocabulary continuousspeech recognition. The following topics are presented: the baseline algorithm, a time-synchronous beam search version, a comparison with the word-conditioned approach, a compariso ..."
Abstract
- Add to MetaCart
Abstract—This paper presents the time-conditioned approach in dynamic programming search for large-vocabulary continuousspeech recognition. The following topics are presented: the baseline algorithm, a time-synchronous beam search version, a comparison with the word-conditioned approach, a comparison with stack decoding. The approach has been successfully tested on the NAB task using a vocabulary of 64 000 words. Index Terms—Beam search, dynamic programming, large vocabulary speech recognition, one-pass DP search, search organization, time-conditioned DP search. I.

