Results 1 - 10
of
16,638
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- ACOUSTICS, SPEECH AND SIGNAL PROCESSING, IEEE TRANSACTIONS ON
, 1980
"... Several parametric representations of the acoustic signal were compared as to word recognition performance in a syllable-oriented continuous speech recognition system. The vocabulary in-cluded many phonetically similar monosyllabic words, therefore the emphasis was on ability to retain phonetically ..."
Abstract
-
Cited by 1089 (2 self)
- Add to MetaCart
Several parametric representations of the acoustic signal were compared as to word recognition performance in a syllable-oriented continuous speech recognition system. The vocabulary in-cluded many phonetically similar monosyllabic words, therefore the emphasis was on ability to retain phonetically significant acoustic information in the face of syntactic and duration variations. For each ~ arameter set (based on a mel-frequency cepstrum, a linear frequency cepstrum, a linear prediction cepstrum, a linear predic-tion spectrum, or a set of reflection coefficients), word templates were generated using an efficient dynamic method, and test data were time registered wi th the templates. A set of ten mel-frequency cepstrum coefficients computed every 6 " 4 ms resulted in the best performance, namely 96.. 5 % and 9500 % recognition with each of two speakers.. The superior performance of the mel-frequency cepstrum coefficients may be attributed to the fact that they better represent the perceptually relevant aspects of the short-term speech spectrum.
Understanding Normal and Impaired Word Reading: Computational Principles in Quasi-Regular Domains
- PSYCHOLOGICAL REVIEW
, 1996
"... We develop a connectionist approach to processing in quasi-regular domains, as exemplified by English word reading. A consideration of the shortcomings of a previous implementation (Seidenberg & McClelland, 1989, Psych. Rev.) in reading nonwords leads to the development of orthographic and phono ..."
Abstract
-
Cited by 583 (94 self)
- Add to MetaCart
in subsequent simulations, including an attractor network that reproduces the naming latency data directly in its time to settle on a response. Further analyses of the network's ability to reproduce data on impaired reading in surface dyslexia support a view of the reading system that incorporates a graded
Relative Income, Happiness and Utility: An Explanation for the Easterlin Paradox and Other Puzzles
, 2007
"... ..."
Scientific workflow management and the Kepler system. Special issue: workflow in grid systems
- Concurr. Comput.: Pract. Exp
, 2006
"... Many scientific disciplines are now data and information driven, and new scientific knowledge is often gained by scientists putting together data analysis and knowledge discovery “pipelines”. A related trend is that more and more scientific communities realize the benefits of sharing their data and ..."
Abstract
-
Cited by 285 (21 self)
- Add to MetaCart
and computational services, and are thus contributing to a distributed data and computational community infrastructure (a.k.a. “the Grid”). However, this infrastructure is only a means to an end and scientists ideally should be bothered little with its existence. The goal is for scientists to focus on development
An Investigation of the Therac-25 Accidents
- IEEE Computer
, 1993
"... A thorough account of the Therac-25 medical electron accelerator accidents reveals previously unknown details and suggests ways to reduce risk in the future. omputers are increasingly being introduced into safety-critical systems and, as a consequence, have been involved in accidents. Some of the mo ..."
Abstract
-
Cited by 271 (2 self)
- Add to MetaCart
A thorough account of the Therac-25 medical electron accelerator accidents reveals previously unknown details and suggests ways to reduce risk in the future. omputers are increasingly being introduced into safety-critical systems and, as a consequence, have been involved in accidents. Some of the most widely cited software-related accidents in safety-critical systems involved a computerized radiation therapy machine called the Therac-25. Between June 1985 and January 1987, six known accidents involved massive overdoses by the Therac-25-with resultant deaths and serious injuries. They have been described as the worst series of radiation accidents in the 35year history of medical acceler-
Der Bundesverband Sicherheitspolitik an Hochschulen
"... (BSH) ist der Dachverband sicherheitspolitischer ..."
PRIVATE HOCHSCHULEN – PRIVATE HIGHER EDUCATION
"... die hochschule. journal für wissenschaft und bildung ..."
Recent advances in the automatic recognition of audiovisual speech
- Proceedings of the IEEE
"... Abstract — Visual speech information from the speaker’s mouth region has been successfully shown to improve noise robustness of automatic speech recognizers, thus promising to extend their usability into the human computer interface. In this paper, we review the main components of audio-visual autom ..."
Abstract
-
Cited by 169 (16 self)
- Add to MetaCart
Abstract — Visual speech information from the speaker’s mouth region has been successfully shown to improve noise robustness of automatic speech recognizers, thus promising to extend their usability into the human computer interface. In this paper, we review the main components of audio-visual automatic speech recognition and present novel contributions in two main areas: First, the visual front end design, based on a cascade of linear image transforms of an appropriate video region-of-interest, and subsequently, audio-visual speech integration. On the later topic, we discuss new work on feature and decision fusion combination, the modeling of audio-visual speech asynchrony, and incorporating modality reliability estimates to the bimodal recognition process. We also briefly touch upon the issue of audio-visual speaker adaptation. We apply our algorithms to three multi-subject bimodal databases, ranging from small- to large-vocabulary recognition tasks, recorded at both visually controlled and challenging environments. Our experiments demonstrate that the visual modality improves automatic speech recognition over all conditions and data considered, however less so for visually challenging environments and large vocabulary tasks. Index Terms — Audio-visual speech recognition, speechreading, visual feature extraction, audio-visual fusion, hidden Markov model, multi-stream HMM, product HMM, reliability estimation, adaptation, audio-visual databases. I.
Results 1 - 10
of
16,638