Results 1 -
9 of
9
Robust Endpoint Detection and Energy Normalization for RealTime Speech and Speaker Recognition.
- IEEE Transactions on Speech and Audio Processing,
, 2002
"... ..."
(Show Context)
Automatic verbal information verification for user authentication
- IEEE Transactions on Speech and Audio Processing
"... Abstract—Traditional speaker authentication focuses on speaker verification (SV) and speaker identification, which is accomplished by matching the speaker’s voice with his or her registered speech patterns. In this paper, we propose a new technique, verbal information verification (VIV), in which sp ..."
Abstract
-
Cited by 21 (3 self)
- Add to MetaCart
Abstract—Traditional speaker authentication focuses on speaker verification (SV) and speaker identification, which is accomplished by matching the speaker’s voice with his or her registered speech patterns. In this paper, we propose a new technique, verbal information verification (VIV), in which spoken utterances of a claimed speaker are verified against the key (usu-ally confidential) information in the speaker’s registered profile automatically to decide whether the claimed identity should be accepted or rejected. Using the proposed sequential procedure involving three question-response turns, we achieved an error-free result in a telephone speaker authentication experiment with 100 speakers. We further propose a speaker authentication system by com-bining VIV with SV. In the system, a user is verified by VIV in the first four to five accesses, usually from different acoustic environments. During these uses, one of the key questions pertains to a pass-phrase for SV. The VIV system collects and verifies the pass-phrase utterance for use as training data for speaker model construction. After a speaker-dependent model is constructed, the system then migrates to SV. This approach avoids the incon-venience of a formal enrollment procedure, ensures the quality of the training data for SV, and mitigates the mismatch caused by different acoustic environments between training and testing. Experiments showed that the proposed system improved the SV performance by over 40 % in equal-error rate compared to a conventional SV system. Index Terms—Speaker authentication, speaker recognition, speaker verification, utterance verification, verbal information verification. I.
Guidelines for experiments on the POLYCOST database
- in Proceedings of a COST 250 workshop on Application of Speaker Recognition Techniques in Telephony
, 1997
"... The purpose of this document is to define a common ground for speaker recognition experiments on the POLYCOST database. It is done by defining a set of baseline experiments for which results always should be included when presenting evaluations made on this database. By including these results and b ..."
Abstract
-
Cited by 14 (3 self)
- Add to MetaCart
(Show Context)
The purpose of this document is to define a common ground for speaker recognition experiments on the POLYCOST database. It is done by defining a set of baseline experiments for which results always should be included when presenting evaluations made on this database. By including these results and by presenting the differences introduced in new experiments, a comparison between systems tested on different sites is made possible. Four baseline experiments are defined: text-dependent speaker verification (SV) on fixed password sentence, text-prompted SV on digit sequence, text-independent SV on free speech in subject's mother tongue and finally text-independent speaker identification on the same free speech. The definition of the baseline experiment includes the definition of client and impostor speakers and speakers for training a world model; sessions for enrollment and test; which speech items to use and how to compute and present results. 1. Introduction The purpose of this documen...
Fabbrizio, “Intelligent virtual agents for contact center automation
- in IEEE Signal Processing Magazine, Volume 22, Number 5
, 2005
"... [A human-machine communication system for next-generation contact centers] ..."
Abstract
-
Cited by 14 (1 self)
- Add to MetaCart
(Show Context)
[A human-machine communication system for next-generation contact centers]
Background model design for flexible and portable speaker verification systems
- In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing
, 1999
"... ..."
(Show Context)
ABSTRACT OF THE THESIS Combining Speech Recognition and Speaker Verification
, 2008
"... and approved by ..."
(Show Context)
Automatic Verbal Information Verification for User Authentication
, 2016
"... Automatic verbal information verification for ..."
Stockholm
"... The aim of this report was to implement a text-dependent speaker verification system using speaker adapted neural networks and to evaluate the system. The idea was to use a hybrid HMM/ANN approach, i.e. Artificial Neural Networks were used to estimate Hidden Markov Model emission posterior probabili ..."
Abstract
- Add to MetaCart
(Show Context)
The aim of this report was to implement a text-dependent speaker verification system using speaker adapted neural networks and to evaluate the system. The idea was to use a hybrid HMM/ANN approach, i.e. Artificial Neural Networks were used to estimate Hidden Markov Model emission posterior probabilities from speech data, and the system was implemented in C++ as a module for GIVES. The report also contains an overview over speaker verification. Methods and algorithms for network training and adaptation are explained, and the performance of the system is tested. Both Multi-Layer perceptrons and Single-Layer perceptrons are tested and compared to other speaker verification systems. The test results show that the hybrid HMM/ANN system does not perform as well as other speaker verification systems, but if the system parameters are optimised further performance might increase. Along with an analysis and summary of the project possible improvements of the system are suggested. Sammanfattning Målet med denna rapport var att implementera ett textberoende talarverifieringssystem med
Speech and Language Processing
"... over the Web [Changing the way people communicate and access information] © IMAGESTATE Over the past decade, the World Wide Web (WWW) has been evolving into a central communication hub for consumers and businesses to efficiently access and deliver multimedia information containing text, speech, grap ..."
Abstract
- Add to MetaCart
(Show Context)
over the Web [Changing the way people communicate and access information] © IMAGESTATE Over the past decade, the World Wide Web (WWW) has been evolving into a central communication hub for consumers and businesses to efficiently access and deliver multimedia information containing text, speech, graphics, audio, or video. In this booming era of the Internet, communication is evolving at an extraordinary pace, changing from voice over traditional landline phones to multimedia data across multiple mobile devices, services, and networks. Technological breakthroughs, which are making people communicate more seamlessly and acquire information more efficiently, are revolutionizing the fields of speech and language processing and providing new research challenges and lucrative business opportunities in areas of communication, entertainment, and marketing. Figure 1 shows a sample of Web-based applications that are benefiting from the Internet revolution as well as from advances made in mobile devices. The use of multimodal user interfaces and multimedia outputs continue to play a role in the evolution of the Internet, transforming traditional business applications such as customer care and security, and promoting newer applications such as information search and mining. As the content and usage of the Web continues to grow, the need for accurate systems to locate or extract meaningful and actionable information will continue to rise. Three types of classes of systems have been evolving over the past decade. The first class includes systems capable of searching through documents using keywords. These systems, more commonly known as search engines, such as Google Search and Yahoo Search, apply advanced language processing and probabilistic methods to index words and phrases to enable rapid retrieval of documents. Search engine Digital Object Identifier 10.1109/MSP.2008.918410 IEEE SIGNAL PROCESSING MAGAZINE [18] MAY 2008 1053-5888/08/$25.00©2008IEEEperformance is rather impressive in terms of efficiency and accuracy of retrieval for top-ten candidates. Google is now able to re-index the Web daily and provide search responses in a fraction of a second. Search engines have also been applied for voice search but at a smaller scale. The most commonly used ones today are automated directory assistance