• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Towards empirical dialogstate modeling and its use in language modeling (2012)

by N G Ward, A Vega
Venue:Interspeech
Add To MetaCart

Tools

Sorted by:
Results 1 - 6 of 6

USING DIALOG-ACTIVITY SIMILARITY FOR SPOKEN INFORMATION RETRIEVAL

by Nigel G. Ward, Steven D. Werner
"... We want to enable users to locate desired information in spoken audio documents using not only the words, but also dialog activities. Following previous research, we infer this information from prosodic features, however, instead of retrieval by matching to a predefined finite set of activities, we ..."
Abstract - Cited by 4 (4 self) - Add to MetaCart
We want to enable users to locate desired information in spoken audio documents using not only the words, but also dialog activities. Following previous research, we infer this information from prosodic features, however, instead of retrieval by matching to a predefined finite set of activities, we estimate similarity using a vector space representation. Utterances close in this vector space are frequently similar not only pragmatically, but also topically. Using this we implemented a dialog-based query-by-example function and built it into an interface for use in combination with normal lexical search. Evaluating its utility by an experiment with four searchers doing twenty tasks each, we found that searchers used the new feature and considered it helpful, but only for some search tasks. 1. Two Views of Audio Search
(Show Context)

Citation Context

...ctivity information. These features were computed every 10 milliseconds throughout the corpus. After PCA this gave 76 dimensions, ordered by how much of the variation they explained. Upon examination =-=[14, 15]-=-, it turned out that most of the top dimensions aligned with various aspects of dialog. These aspects were diverse, including dialog situations, transient dialog states, cooperative dialog acts, simpl...

Where in Dialog Space does Uh-huh Occur?

by Nigel G. Ward, David G. Novick, Ro Vega
"... In what dialog situations and contexts do backchannels commonly occur? This paper examines this question using a newly developed notion of dialog space, defined by orthogonal, prosody-derived dimensions. Taking 3363 instances of uh-huh, found in the Switchboard corpus, we examine where in this space ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
In what dialog situations and contexts do backchannels commonly occur? This paper examines this question using a newly developed notion of dialog space, defined by orthogonal, prosody-derived dimensions. Taking 3363 instances of uh-huh, found in the Switchboard corpus, we examine where in this space they tend to occur. While the results largely agree with previous descriptions and observations, we find several novel aspects, relating to rhythm, polarity, and the details of the low-pitch cue. Index Terms: backchannels, feedback, prosody, context, principal component analysis, dimensions, dialog activities
(Show Context)

Citation Context

...e corpus. We then applied Principal Component Analysis to these values. This gave a list of 76 dimensions, ordered by how much of the variation in the prosodic features they explain. Upon examination =-=[7, 8]-=-, most of the top dimensions turned out to align with aspects of dialog. These aspects were diverse, including dialog situations, transient dialog states, cooperative dialog acts, simpler dialog actio...

Detecting Differences in Communication During Two Types of Patient Handovers: A Linguistic Construct Categorization Approach

by Zachary Woods, Brian Hilligoss, Andrew Duchon, Aptima Inc, Nicholas Beecroft, Emily S. Patterson
"... Patient handovers are a critical point in the patient care process. Software to identify differences in communication content and strategies across different types of patient handovers could be helpful in customizing physician training programs. To determine whether there were differences, Linguisti ..."
Abstract - Add to MetaCart
Patient handovers are a critical point in the patient care process. Software to identify differences in communication content and strategies across different types of patient handovers could be helpful in customizing physician training programs. To determine whether there were differences, Linguistic Inquiry and Word Count (LIWC) software was used. The primary measure was the LIWC output score, which is the frequency of mention of words in a construct category divided by the total number of words in the handover transcript. Two types of constructs were investigated: 1) content, which included name/age, care plan, prognosis, and family, and 2) strategy, which included questioning and collaborative cross-checks. We hypothesized that the Emergency Department (ED) to hospital transfer compared to Intensive Care Unit (ICU) sign-outs would have more discussion of family and less of the patient’s prognosis, as well as more collaborative cross-checks. A two-tailed t-test was used to detect differences. One hypothesis was confirmed, that there was less discussion of prognosis in the ED as compared to the ICU handover. Unexpected findings were less discussion of the care plan and more questioning in the ED as compared to the ICU handover. Findings confirm that both communication content and strategies are different for the two types of patient handovers and that an automated analysis approach can detect differences across a set of handover transcripts.

Sub-lexical Dialogue Act Classification in a Spoken Dialogue System Support for the Elderly with Cognitive Disabilities

by Ken Sadohara, Hiroaki Kojima, Takuya Narita, Misato Nihei, Minoru Kamata, Shinichi Onaka, Yoshihiro Fujita, Takenobu Inoue
"... This paper presents a dialogue act classification for a spoken dialogue system that delivers necessary information to elderly subjects with mild dementia. Lexical features have been shown to be effective for classification, but the automatic transcription of spontaneous speech demands expensive lang ..."
Abstract - Add to MetaCart
This paper presents a dialogue act classification for a spoken dialogue system that delivers necessary information to elderly subjects with mild dementia. Lexical features have been shown to be effective for classification, but the automatic transcription of spontaneous speech demands expensive language modeling. Therefore, this paper proposes a classifier that does not require language modeling and that uses sub-lexical features instead of lexical features. This classifier operates on sequences of phonemes obtained by a phoneme recognizer and exhaustively analyzes the saliency of all possible sub-sequences using a support vector machine with a string kernel. An empirical study of a dialogue corpus containing elderly speech showed that the sub-lexical classifier was robust against the poor modeling of language and it performed better than a lexical classifier that used hidden Markov models of words. Index Terms: dialogue acts, support vector machines, string kernels, spontaneous speech, elderly speech, dementia
(Show Context)

Citation Context

...ation of DAs The automatic classification of DAs comprises two important components: features and modeling methods. The features investigated previously used various types of knowledge, e.g., lexical =-=[21, 1, 2, 3, 22, 4, 23]-=-, syntactic [22, 24], prosodic [13, 1, 3, 22, 23], and discourse structural [25, 1]. In this study, sublexical features, i.e., sequences of phonemes, were considered together with the DA of the preced...

APPROVED:

by Alejandro Vega, David Novick Ph. D, Jon Amastae Ph. D, Benjamin C. Flores, Ph. D
"... I would like to give thanks to all my family and friends. Without their support all these years, I would have never gotten to where I am right now. I want to thank my family for their sacrifices and patience. I want to thank my friends for their support and company during the good and tough times. I ..."
Abstract - Add to MetaCart
I would like to give thanks to all my family and friends. Without their support all these years, I would have never gotten to where I am right now. I want to thank my family for their sacrifices and patience. I want to thank my friends for their support and company during the good and tough times. I want to give my deepest thanks to Nigel Ward for all his help as an advisor and mentor. I thank David Novick for being a mentor and teacher and for all the helpful feedback all these years. I thank Jon Amastae for all the helpful comments and being my committee member. I would like to give deep thanks to Shreyas Karkhedkar for his help with the Respond features, for his help being someone I could always talk to when I had trouble, and his help as one of my best friends during this process. This work was supported in part by NSF Award IIS-0914868. Previous studies show that immediate and long range prosodic context provide beneficial information when applied to a language model.
(Show Context)

Citation Context

... in an ASR system, achieved a significant 1.0% reduction in WER (0.4% absolute) when applied to a German dialog corpus. These promising results using only immediate prosodic context led Ward and Vega =-=[12]-=- to use more prosodic features to potentially increase the information given to the LM. Within a six-second window centered at word onset, a feature set including both speaker and interlocutor volume,...

with loveUSING EMOTION AS INFERRED FROM PROSODY IN LANGUAGE MODELING

by Shreyas Ashok Karkhedkar, David Novick Ph. D, Stephen Crites, Benjamin C. Flores, Ph. D
"... family ..."
Abstract - Add to MetaCart
Abstract not found
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University