• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Using dialog-activity similarity for spoken information retrieval. (2013)

by N G Ward, S D Werner
Venue:In Interspeech.
Add To MetaCart

Tools

Sorted by:
Results 1 - 4 of 4

The Similar Segments in Social Speech Task

by Nigel G. Ward, Steven D. Werner, David G. Novick, Elizabeth E. Shriberg, Catharine Oertel, Tatsuya Kawahara , 2013
"... Similar Segments in Social Speech was one of the Brave New Tasks at MediaEval 2013. The task involves finding segments similar to a query segment, in a multimedia collection of informal, unstructured dialogs among members of a small community. Categories and Subject Descriptors ..."
Abstract - Cited by 5 (4 self) - Add to MetaCart
Similar Segments in Social Speech was one of the Brave New Tasks at MediaEval 2013. The task involves finding segments similar to a query segment, in a multimedia collection of informal, unstructured dialogs among members of a small community. Categories and Subject Descriptors
(Show Context)

Citation Context

... resemble undirected search or even recommendation requests more than directed search. Despite the large volume of research in technologies for audio and multimedia search, as surveyed for example by =-=[1, 5]-=-, there has been no research addressing such a scenario, or otherwise search in social multimedia. There is a need for evaluation support, both for examinations of the suitability for this task of exi...

Data collection for the Similar Segments in Social Speech task.

by Nigel G Ward , Steven D Werner , 2013
"... Information retrieval systems rely heavily on models of similarity, but for spoken dialog such models currently use mostly standard textual-content similarity. As part of the MediaEval Benchmarking Initiative, we have created a new corpus to support development of similarity models for spoken dialo ..."
Abstract - Cited by 3 (3 self) - Add to MetaCart
Information retrieval systems rely heavily on models of similarity, but for spoken dialog such models currently use mostly standard textual-content similarity. As part of the MediaEval Benchmarking Initiative, we have created a new corpus to support development of similarity models for spoken dialog. This corpus includes 26 casual dialogs among members of two semi-cohesive groups, totaling about 5 hours, with 1889 labeled regions associated into 227 sets which annotators judged to be similar enough to share a tag. This technical report brings together information about this corpus and its intended uses, previously only available on the project website.

Evaluating Prosody-Based Similarity Models for Information Retrieval

by Steven D. Werner, Nigel G. Ward
"... Prosody is important in spoken language, and especially in dialog, but its utility for search in dialog archives has remained an open question. Using prosody-based measures of similarity, which also roughly correlate with dialog-activity similarity and topic similarity, we built support for “retriev ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
Prosody is important in spoken language, and especially in dialog, but its utility for search in dialog archives has remained an open question. Using prosody-based measures of similarity, which also roughly correlate with dialog-activity similarity and topic similarity, we built support for “retrieve more like this ” searches. Performance on the Similar Segments in Social Speech Task at MediaEval 2013 was well above baseline, showing the value of prosody for search. 1.
(Show Context)

Citation Context

...more narrow-window features close to the point of interest and fewer distant-context features. After PCA this gave 78 dimensions, ordered by how much of the variation they explained. In previous work =-=[4]-=- we found that dialog timepoints which were proximal in this space tended to be similar not only in dialog activity but in topic as well. Here we extend this work to use better similarity models, and ...

A prosody-based vectorspace model of dialog activity for information retrieval

by Nigel G Ward , Steven D Werner , Fernando Garcia , Emilio Sanchis - Speech Communication , 2015
"... Abstract Search in audio archives is a challenging problem. Using prosodic information to help find relevant content has been proposed as a complement to word-based retrieval, but its utility has been an open question. We propose a new way to use prosodic information in search, based on a vector-sp ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
Abstract Search in audio archives is a challenging problem. Using prosodic information to help find relevant content has been proposed as a complement to word-based retrieval, but its utility has been an open question. We propose a new way to use prosodic information in search, based on a vector-space model, where each point in time maps to a point in a vector space whose dimensions are derived from numerous prosodic features of the local context. Point pairs that are close in this vector space are frequently similar, not only in terms of the dialog activities, but also in topic. Using proximity in this space as an indicator of similarity, we built support for a query-by-example function. Searchers were happy to use this function, and it provided value on a large testset. Prosody-based retrieval did not perform as well as word-based retrieval, but the two sources of information were often non-redundant and in combination they sometimes performed better than either separately.
(Show Context)

Citation Context

...ties. 2. Background: Prosody for Search in Speech Most current spoken dialogue retrieval systems are based on the view that speech is essentially just noise-corrupted text (Chelba et al., 2008). They use speech recognition techniques to infer the words said, and then use text-based search techniques on the resulting transcript. However the performance of such systems is generally weak, and today audio 1specifically two conference proceedings, three workshop papers, and two technical reports: The idea of using a vector-space model of prosodic context for information retrieval was presented in (Ward and Werner, 2013b), the qualitative analysis of similarity in this space was presented partly there and partly in (Ward et al., submitted), the initial user study was reported in (Ward and Werner, 2013b), the need for a corpus of social speech was explained in (Ward and Werner, 2012) and described partly in (Ward et al., 2013) and (Ward and Werner, 2013a), the comparison of the training schemes and distance metrics was reported in (Werner and Ward, 2013), and the comparison to and combination with lexical measures of similarity was reported in part in (Garcia et al., 2013) and in (Ward et al., submitted). 2 s...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University