• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 390
Next 10 →

Speaker verification using Adapted Gaussian mixture models

by Douglas A. Reynolds, Thomas F. Quatieri, Robert B. Dunn - Digital Signal Processing , 2000
"... In this paper we describe the major elements of MIT Lincoln Laboratory’s Gaussian mixture model (GMM)-based speaker verification system used successfully in several NIST Speaker Recognition Evaluations (SREs). The system is built around the likelihood ratio test for verification, using simple but ef ..."
Abstract - Cited by 1010 (42 self) - Add to MetaCart
is also described and discussed. Finally, representative performance benchmarks and system behavior experiments on NIST SRE corpora are presented. © 2000 Academic Press Key Words: speaker recognition; Gaussian mixture models; likelihood ratio detector; universal background model; handset normalization

recognition: the case of NIST-SRE

by Dayana Ribas, Emmanuel Vincent, Jose ́ Ramon Calvo, Dayana Ribas, Emmanuel Vincent, Jose Ramon, Calvo Uncertainty, Dayana Ribas, Emmanuel Vincent, Jose ́ Ramón Calvo , 2015
"... Uncertainty propagation for noise robust speaker ..."
Abstract - Add to MetaCart
Uncertainty propagation for noise robust speaker

BUT system description: NIST SRE 2008

by Valiantsina Hubeika, Marcel Kockmann, Petr Schwarz, Jan “honza ˇcernocký
"... BUT submitted three systems to NIST SRE 2008 eval-uations, only to the short2-short3 condition. The pri-mary system is a fusion of three sub-systems: 2 based on MFCC and factor analysis and one making use of SVM ..."
Abstract - Add to MetaCart
BUT submitted three systems to NIST SRE 2008 eval-uations, only to the short2-short3 condition. The pri-mary system is a fusion of three sub-systems: 2 based on MFCC and factor analysis and one making use of SVM

Speaker Cluster based GMM Tokeni

by Bin Ma, Donglai Zhu, Rong T, Heng Mui Keng Terrace
"... We present a speaker recognition system with multiple GMM tokenizers as the front-end, and vector space modeling as the back-end classifier. GMM tokenizer captures the acoustic and phonetic characteristics of a speaker from the speech without the need of phonetic transcription. To enhance the speake ..."
Abstract - Add to MetaCart
-gram probabilities. Multiple vectors are concatenated to form a composite vector. Finally the Support Vector Machine (SVM) is used as the back-end classifier of the composite vectors. We use the 2002 NIST Speaker Recognition Evaluation (SRE) corpus for training GMM tokenizers and background modeling, and evaluate

The NIST Meeting Room Pilot Corpus

by John S. Garofolo, Christophe D. Laprun, Martial Michel, Vincent M. Stanford, Elham Tabassi - In: Proc. of Language Resource and Evaluation Conference , 2004
"... One of the next big challenges in Automatic Speech Recognition (ASR) is the transcription of speech in meetings. This task is particularly problematic for current recognition technologies because, in most realistic meeting scenarios, the vocabularies are unconstrained, the speech is spontaneous and ..."
Abstract - Cited by 27 (0 self) - Add to MetaCart
and often overlapping, and the microphones are inconspicuously placed. To support the development of meeting recognition technologies by both the speech recognition and video extraction research communities, NIST is providing a development and evaluation infrastructure including: a multi-media corpus

SVM based speaker verification using a GMM supervector kernel and NAP variability compensation

by W. M. Campbell, D. E. Sturim, D. A. Reynolds, A. Solomonoff - in Proceedings of ICASSP, 2006
"... Gaussian mixture models with universal backgrounds (UBMs) have become the standard method for speaker recognition. Typically, a speaker model is constructed by MAP adaptation of the means of the UBM. A GMM supervector is constructed by stacking the means of the adapted mixture components. A recent d ..."
Abstract - Cited by 161 (16 self) - Add to MetaCart
between the method of SVM nuisance attribute projection (NAP) and the recent results in latent factor analysis. Experiments on a NIST SRE 2005 corpus demonstrate the effectiveness of the new technique. 1.

Variational bayes logistic regression as regularized fusion for NIST sre 2010

by Kong Aik Lee, Anthony Larcher, Tomi Kinnunen, Bin Ma, Haizhou Li - in Proc. Odyssey: the Speaker and Language Recognition Workshop , 2012
"... Fusion of the base classifiers is seen as a way to achieve high performance in state-of-the-art speaker verification systems. Typically, we are looking for base classifiers that would be com-plementary. We might also be interested in reinforcing good base classifiers by including others that are sim ..."
Abstract - Cited by 10 (8 self) - Add to MetaCart
Fusion of the base classifiers is seen as a way to achieve high performance in state-of-the-art speaker verification systems. Typically, we are looking for base classifiers that would be com-plementary. We might also be interested in reinforcing good base classifiers by including others that are similar to them. In any case, the final ensemble size is typically small and has to be formed based on some rules of thumb. We are interested to find out a subset of classifiers that has a good generalization perfor-mance. We approach the problem from sparse learning point of view. We assume that the true, but unknown, fusion weights are sparse. As a practical solution, we regularize weighted logistic regression loss function by elastic-net and LASSO constraints. However, all regularization methods have an additional param-eter that controls the amount of regularization employed. This needs to be separately tuned. In this work, we use variational Bayes approach to automatically obtain sparse solutions without additional cross-validation. Variational Bayes method improves the baseline method in 3 out of 4 sub-conditions. Index Terms: logistic regression, regularization, compressed sensing, linear fusion, speaker verification

Leeuwen, “The radboud university nijmegen submission to nist sre 2012

by Rahim Saeidi, David A. Van Leeuwen - in NIST Speaker Recognition Evaluation Workshop , 2012
"... For the participation in SRE 2012 the Radboud University Nijmegen (RUN) team was part of a larger effort “I4U, ” to which we contributed development test segment lists and score matrices used for fusion in the I4U submission. This system description is about the RUN system, for which the scores have ..."
Abstract - Cited by 7 (6 self) - Add to MetaCart
For the participation in SRE 2012 the Radboud University Nijmegen (RUN) team was part of a larger effort “I4U, ” to which we contributed development test segment lists and score matrices used for fusion in the I4U submission. This system description is about the RUN system, for which the scores

Comparison of Input and Feature Space Nonlinear Kernel Nuisance Attribute Projections for Speaker Verification

by Xianyu Zhao, Yuan Dong, Jian Zhao, Liang Lu, Jiqing Liu, Haila Wang
"... Nuisance attribute projection (NAP) was an effective method to reduce session variability in SVM-based speaker verification systems. As the expanded feature space of nonlinear kernels is usually high or infinite dimensional, it is difficult to find nuisance directions via conventional eigenvalue ana ..."
Abstract - Add to MetaCart
approach, a gradient descent algorithm is proposed to find out projection over input variables. Experimental results on the 2006 NIST SRE corpus show that both kinds of NAP can reduce unwanted variability in nonlinear kernels to improve verification performance; and NAP performed in expanded feature space

Glottal Source Cepstrum Coefficients Applied To NIST SRE 2010

by L M Mazaira , A Álvarez , P Gómez , R Martínez , C Muñoz
"... Abstract. Through the present paper, a novel feature set for speaker recognition based on glottal estimate information is presented. An iterative algorithm is used to derive the vocal tract and glottal source estimations from speech signal. In order to test the importance of glottal source informat ..."
Abstract - Add to MetaCart
information in speaker characterization, the novel feature set has been tested in the 2010 NIST Speaker Recognition Evaluation (NIST SRE10). The proposed system uses glottal estimate parameter templates and classical cepstral information to build a model for each speaker involved in the recognition process
Next 10 →
Results 1 - 10 of 390
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University