Results 1 -
6 of
6
Word level confidence annotation using combinations of features”, European conference on speech communication and technology
, 2001
"... This paper describes the development of a word-level confidence metric suitable for use in a dialog system. Two aspects of the problems are investigated: the identification of useful features and the selection of an effective classifier. We find that two parse-level features, Parsing-Mode and Slot-B ..."
Abstract
-
Cited by 20 (1 self)
- Add to MetaCart
This paper describes the development of a word-level confidence metric suitable for use in a dialog system. Two aspects of the problems are investigated: the identification of useful features and the selection of an effective classifier. We find that two parse-level features, Parsing-Mode and Slot-Backoff-Mode, provide annotation accuracy comparable to that observed for decoder-level features. However, both decoderlevel and parse-level features independently contribute to confidence annotation accuracy. In comparing different classification techniques, we found that Support Vector Machines (SVMs) appear to provide the best accuracy. Overall we achieve 39.7 % reduction in annotation uncertainty for a binary confidence decision in a travel-planning domain. 1.
University of Colorado Dialog Systems for Travel and Navigation
"... This paper presents recent improvements in the development of the University of Colorado "CU Communicator" and "CUMove " spoken dialog systems. First, we describe the CU Communicator system that integrates speech recognition, synthesis and natural language understanding technologies using the DARPA ..."
Abstract
-
Cited by 7 (1 self)
- Add to MetaCart
This paper presents recent improvements in the development of the University of Colorado "CU Communicator" and "CUMove " spoken dialog systems. First, we describe the CU Communicator system that integrates speech recognition, synthesis and natural language understanding technologies using the DARPA Hub Architecture. Users are able to converse with an automated travel agent over the phone to retrieve up-to-date travel information such as flight schedules, pricing, along with hotel and rental car availability. The CU Communicator has been under development since April of 1999 and represents our test-bed system for developing robust human-computer interactions where reusability and dialogue system portability serve as two main goals of our work. Next, we describe our more recent work on the CU Move dialog system for in-vehicle route planning and guidance. This work is in joint collaboration with HRL and is sponsored as part of the DARPA Communicator program. Specifically, we will provide an overview of the task, describe the data collection environment for in-vehicle systems development, and describe our initial dialog system constructed for route planning.
Out-of-Domain Utterance Detection using Classification Confidences of Multiple Topics
"... Abstract — One significant problem for spoken language systems is how to cope with users ’ OOD (out-of-domain) utterances which cannot be handled by the back-end application system. In this paper, we propose a novel OOD detection framework, which makes use of the classification confidence scores of ..."
Abstract
-
Cited by 4 (0 self)
- Add to MetaCart
Abstract — One significant problem for spoken language systems is how to cope with users ’ OOD (out-of-domain) utterances which cannot be handled by the back-end application system. In this paper, we propose a novel OOD detection framework, which makes use of the classification confidence scores of multiple topics and applies a linear discriminant model to perform in-domain verification. The verification model is trained using a combination of deleted interpolation of the in-domain data and minimumclassification-error training, and does not require actual OOD data during the training process, thus realizing high portability. When applied to the “phrasebook ” system, a single utterance read-style speech task, the proposed approach achieves an absolute reduction in OOD detection errors of up to 8.1 points (40% relative) compared to a baseline method based on the maximum topic classification score. Furthermore, the proposed approach realizes comparable performance to an equivalent system trained on both in-domain and OOD data, while requiring no OOD data during training. We also apply this framework to the “machineaided-dialogue” corpus, a spontaneous dialogue speech task, and extend the framework in two manners. First, we introduce topic clustering which enables reliable topic confidence scores to be generated even for indistinct utterances, and second, we implement methods to effectively incorporate dialogue context. Integration of these two methods into the proposed framework significantly improves OOD detection performance, achieving a further reduction in EER of 7.9 points. I.
Response-Based Confidence Annotation for Spoken Dialogue Systems
"... Spoken and multimodal dialogue systems typically make use of confidence scores to choose among (or reject) a speech recognizer’s N-best hypotheses for a particular utterance. We argue that it is beneficial to instead choose among a list of candidate system responses. We propose a novel method in whi ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
Spoken and multimodal dialogue systems typically make use of confidence scores to choose among (or reject) a speech recognizer’s N-best hypotheses for a particular utterance. We argue that it is beneficial to instead choose among a list of candidate system responses. We propose a novel method in which a confidence score for each response is derived from a classifier trained on acoustic and lexical features emitted by the recognizer, as well as features culled from the generation of the candidate response itself. Our responsebased method yields statistically significant improvements in F-measure over a baseline in which hypotheses are chosen based on recognition confidence scores only. 1
Improving Spoken Dialog Systems
, 2000
"... In this paper the possible sources of inaccuracies of a spoken dialog system are discussed, and the approaches to address each of them are surveyed. ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
In this paper the possible sources of inaccuracies of a spoken dialog system are discussed, and the approaches to address each of them are surveyed.
Word Level Confidence Annotation using Combinations of Features
, 2001
"... This paper describes the development of a word-level confidence metric suitable for use in a dialog system. Two aspects of the problems are investigated: the identification of useful features and the selection of an effective classifier. We find that two parse-level features, Parsing-Mode and SlotBa ..."
Abstract
- Add to MetaCart
This paper describes the development of a word-level confidence metric suitable for use in a dialog system. Two aspects of the problems are investigated: the identification of useful features and the selection of an effective classifier. We find that two parse-level features, Parsing-Mode and SlotBackoff -Mode, provide annotation accuracy comparable to that observed for decoder-level features. However, both decoderlevel and parse-level features independently contribute to confidence annotation accuracy. In comparing different classification techniques, we found that Support Vector Machines (SVMs) appear to provide the best accuracy. Overall we achieve 39.7% reduction in annotation uncertainty for a binary confidence decision in a travel-planning domain.

