• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Query-by-example spoken term detection using phonetic posteriorgram templates (2009)

by Timothy J Hazen, Wade Shen, Christopher White
Add To MetaCart

Tools

Sorted by:
Results 1 - 4 of 4

Unsupervised Spoken Keyword Spotting via Segmental DTW on Gaussian Posteriorgrams

by Yaodong Zhang, James R. Glass
"... Abstract—In this paper, we present an unsupervised learning framework to address the problem of detecting spoken ..."
Abstract - Cited by 7 (4 self) - Add to MetaCart
Abstract—In this paper, we present an unsupervised learning framework to address the problem of detecting spoken

AN INNER-PRODUCT LOWER-BOUND ESTIMATE FOR DYNAMIC TIME WARPING

by Yaodong Zhang, James R. Glass
"... In this paper, we present a lower-bound estimate for dynamic time warping (DTW) on time series consisting of multi-dimensional posterior probability vectors known as posteriorgrams. We develop a lower-bound estimate based on the inner-product distance that has been found to be an effective metric fo ..."
Abstract - Cited by 2 (1 self) - Add to MetaCart
In this paper, we present a lower-bound estimate for dynamic time warping (DTW) on time series consisting of multi-dimensional posterior probability vectors known as posteriorgrams. We develop a lower-bound estimate based on the inner-product distance that has been found to be an effective metric for computing similarities between posteriorgrams. In addition to deriving the lower-bound estimate, we show how it can be efficiently used in an admissible K nearest neighbor (KNN) search for spotting matching sequences. We quantify the amount of computational savings achieved by performing a set of unsupervised spoken keyword spotting experiments using Gaussian mixture model posteriorgrams. In these experiments the proposed lower-bound estimate eliminates 89 % of the DTW previously required calculations without affecting overall keyword detection performance. Index Terms — dynamic time warping, posteriorgram 1.

Association (2011)" Zero-resource

by O Muscariello, Guillaume Gravier, Frédéric Bimbot , 2011
"... audio-only spoken term detection based on a combination of template matching techniques ..."
Abstract - Add to MetaCart
audio-only spoken term detection based on a combination of template matching techniques

Employing Subsequence Matching in Audio Data Processing

by Petr Volný, David Novák, Pavel Zezula , 2011
"... ..."
Abstract - Add to MetaCart
Abstract not found
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University