Results 1 -
4 of
4
Unsupervised Spoken Keyword Spotting via Segmental DTW on Gaussian Posteriorgrams
"... Abstract—In this paper, we present an unsupervised learning framework to address the problem of detecting spoken ..."
Abstract
-
Cited by 7 (4 self)
- Add to MetaCart
Abstract—In this paper, we present an unsupervised learning framework to address the problem of detecting spoken
AN INNER-PRODUCT LOWER-BOUND ESTIMATE FOR DYNAMIC TIME WARPING
"... In this paper, we present a lower-bound estimate for dynamic time warping (DTW) on time series consisting of multi-dimensional posterior probability vectors known as posteriorgrams. We develop a lower-bound estimate based on the inner-product distance that has been found to be an effective metric fo ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
In this paper, we present a lower-bound estimate for dynamic time warping (DTW) on time series consisting of multi-dimensional posterior probability vectors known as posteriorgrams. We develop a lower-bound estimate based on the inner-product distance that has been found to be an effective metric for computing similarities between posteriorgrams. In addition to deriving the lower-bound estimate, we show how it can be efficiently used in an admissible K nearest neighbor (KNN) search for spotting matching sequences. We quantify the amount of computational savings achieved by performing a set of unsupervised spoken keyword spotting experiments using Gaussian mixture model posteriorgrams. In these experiments the proposed lower-bound estimate eliminates 89 % of the DTW previously required calculations without affecting overall keyword detection performance. Index Terms — dynamic time warping, posteriorgram 1.
Association (2011)" Zero-resource
, 2011
"... audio-only spoken term detection based on a combination of template matching techniques ..."
Abstract
- Add to MetaCart
audio-only spoken term detection based on a combination of template matching techniques

