• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

DMCA

Towards unsupervised pattern discovery in speech (2008)

Cached

  • Download as a PDF

Download Links

  • [people.csail.mit.edu]
  • [www.sls.csail.mit.edu]
  • [groups.csail.mit.edu]
  • [www.sls.lcs.mit.edu]
  • [groups.csail.mit.edu]
  • [www.sls.csail.mit.edu]
  • [groups.csail.mit.edu]
  • [www.sls.lcs.mit.edu]
  • [groups.csail.mit.edu]

  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Alex S. Park , James R. Glass
Citations:77 - 10 self
  • Summary
  • Citations
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@MISC{Park08towardsunsupervised,
    author = {Alex S. Park and James R. Glass},
    title = {Towards unsupervised pattern discovery in speech},
    year = {2008}
}

Share

Facebook Twitter Reddit Bibsonomy

OpenURL

 

Abstract

We present a novel approach to speech processing based on the principle of pattern discovery. Our work represents a departure from traditional models of speech recognition, where the end goal is to classify speech into categories defined by a prespecified inventory of lexical units (i.e., phones or words). Instead, we attempt to discover such an inventory in an unsupervised manner by exploiting the structure of repeating patterns within the speech signal. We show how pattern discovery can be used to automatically acquire lexical entities directly from an untranscribed audio stream. Our approach to unsupervised word acquisition utilizes a segmental variant of a widely used dynamic programming technique, which allows us to find matching acoustic patterns between spoken utterances. By aggregating information about these matching patterns across audio streams, we demonstrate how to group similar acoustic sequences together to form clusters corresponding to lexical entities such as words and short multiword phrases. On a corpus of academic lecture material, we demonstrate that clusters found using this technique exhibit high purity and that many of the corresponding lexical identities are relevant to the underlying audio stream.

Keyphrases

towards unsupervised pattern discovery    lexical entity    pattern discovery    traditional model    underlying audio stream    matching acoustic pattern    spoken utterance    prespecified inventory    segmental variant    word acquisition    unsupervised manner    lexical unit    unsupervised pattern discovery    academic lecture material    audio stream    unsupervised word acquisition    untranscribed audio stream    speech recognition    group similar acoustic sequence    dynamic programming technique    novel approach    end goal    short multiword phrase    corresponding lexical identity    speech signal    index term speech processing    technique exhibit high purity   

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University