• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 11,504
Next 10 →

Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences

by Steven B. Davis, Paul Mermelstein - ACOUSTICS, SPEECH AND SIGNAL PROCESSING, IEEE TRANSACTIONS ON , 1980
"... Several parametric representations of the acoustic signal were compared as to word recognition performance in a syllable-oriented continuous speech recognition system. The vocabulary in-cluded many phonetically similar monosyllabic words, therefore the emphasis was on ability to retain phonetically ..."
Abstract - Cited by 1120 (2 self) - Add to MetaCart
phonetically significant acoustic information in the face of syntactic and duration variations. For each ~ arameter set (based on a mel-frequency cepstrum, a linear frequency cepstrum, a linear prediction cepstrum, a linear predic-tion spectrum, or a set of reflection coefficients), word templates were

Speaker, Environment And Channel Change Detection And Clustering Via The Bayesian Information Criterion

by Scott Shaobing Chen, P. S. Gopalakrishnan , 1998
"... In this paper, we are interested in detecting changes in speaker identity, environmental condition and channel condition; we call this the problem of acoustic change detection. The input audio stream can be modeled as a Gaussian process in the cepstral space. We present a maximum likelihood approach ..."
Abstract - Cited by 272 (2 self) - Add to MetaCart
In this paper, we are interested in detecting changes in speaker identity, environmental condition and channel condition; we call this the problem of acoustic change detection. The input audio stream can be modeled as a Gaussian process in the cepstral space. We present a maximum likelihood

The HEARSAY-II speech understanding system: Integrating knowledge to resolve uncertainty

by Lee D. Erman, Victor R. Lesser - Computing Surveys , 1980
"... The Hearsay-II system, developed during the DARPA-sponsored five-year speech-understanding research program, represents both a specific solution to the speech-understanding problem and a general framework for coordinating independent processes to achieve cooperative problem-solving behavior. As a co ..."
Abstract - Cited by 392 (23 self) - Add to MetaCart
computational problem, speech understanding reflects a large number of intrinsically interesting issues. Spoken sounds are achieved by a long chain of successive transformations, from intentions, through semantm and syntactic structurmg, to the eventually resulting audible acoustic waves. As a consequence

The motor theory of speech perception revised

by Alvin M. Liberman, Ignatius G. Mattingly - Cognition , 1985
"... A motor theory of speech perception, initially proposed to account for results of early experiments with synthetic speech, is now extensively revised to accommodate recent findings, and to relate the assumptions of the theory to those that might be made about other perceptual modes. According to the ..."
Abstract - Cited by 372 (2 self) - Add to MetaCart
to the revised theory, phonetic information is perceived in a biologically distinct system, a ‘module ’ specialized to detect the intended gestures of the speaker that are the basis for phonetic categories. Built into the structure of this module is the unique but lawful relationship between the gestures

A Compact Model for Speaker-Adaptive Training

by Tasos Anastasakos, John Mcdonough, Richard Schwartz, John Makhoul - in Proc. ICSLP , 1996
"... In this work we formulate a novel approach to estimating the parameters of continuous density HMMs for speaker-independent (SI) continuous speech recognition. It is motivated by the fact that variability in SI acoustic models is attributed to both phonetic variation and variation among the speakers ..."
Abstract - Cited by 208 (20 self) - Add to MetaCart
of the training population, that is independent of the information content of the speech signal. These two variation sources are decoupled and the proposed method jointly annihilates the inter-speaker variation and estimates the HMM parameters of the SI acoustic models.

Acoustic-Labial Speaker Verification

by P. Jourlin, J. Lüttin, D. Genoud, H. Wassner , 1997
"... This paper describes a multimodal approach for speaker verification. The system consists of two classifiers, one using visual features, the other using acoustic features. A lip tracker is used to extract visual information from the speaking face which provides shape and intensity features. We descri ..."
Abstract - Cited by 38 (2 self) - Add to MetaCart
This paper describes a multimodal approach for speaker verification. The system consists of two classifiers, one using visual features, the other using acoustic features. A lip tracker is used to extract visual information from the speaking face which provides shape and intensity features. We

Locating the Nodes -- Cooperative localization in wireless sensor networks

by Neal Patwari, Joshua N. Ash, Spyros Kyperountas, Alfred O. Hero III, Randolph L. Moses, et al. , 2005
"... Accurate and low-cost sensor localization is a critical requirement for the deployment of wireless sensor networks in a wide variety of applications. Low-power wireless sensors may be many hops away from any other sensors with a priori location information. In cooperative localization, sensors work ..."
Abstract - Cited by 305 (12 self) - Add to MetaCart
Accurate and low-cost sensor localization is a critical requirement for the deployment of wireless sensor networks in a wide variety of applications. Low-power wireless sensors may be many hops away from any other sensors with a priori location information. In cooperative localization, sensors work

Acoustic beamforming for speaker diarization of meetings

by Xavier Anguera, Chuck Wooters, Javier Hern - IEEE TASLP , 2007
"... Abstract—When performing speaker diarization on recordings from meetings, multiple microphones of different qualities are usually available and distributed around the meeting room. Although several approaches have been proposed in recent years to take advantage of multiple microphones, they are eith ..."
Abstract - Cited by 38 (3 self) - Add to MetaCart
information. Tests on speaker diarization show a 25 % relative improvement on the test set compared to using a single most centrally located microphone. Additional experimental results show improvements using these techniques in a speech recognition task. Index Terms—acoustic beamforming, speaker diarization

Conceptual pacts and lexical choice in conversation

by Susan E. Brennan, Herbert H. Clark - Journal of Experimental Psychology: Learning, Memory, and Cognition , 1996
"... When people in conversation refer repeatedly to the same object, they come to use the same terms. This phenomenon, called lexical entrainment, has several possible explanations. Ahistorical accounts appeal only to the informativeness and availability of terms and to the current salience of the objec ..."
Abstract - Cited by 253 (16 self) - Add to MetaCart
When people in conversation refer repeatedly to the same object, they come to use the same terms. This phenomenon, called lexical entrainment, has several possible explanations. Ahistorical accounts appeal only to the informativeness and availability of terms and to the current salience

Speaker verification based on fusion of acoustic and articulatory information

by Ming Li, Jangwon Kim, Prasanta Ghosh, Vikram Ramanarayanan, Shrikanth Narayanan - in Proceedings of Interspeech, 2013
"... • Motivation of using articulatory information for speaker ID • Speaker Id based on acoustic-to-articulatory inversion features • Methods ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
• Motivation of using articulatory information for speaker ID • Speaker Id based on acoustic-to-articulatory inversion features • Methods
Next 10 →
Results 1 - 10 of 11,504
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University