• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

DMCA

Fitting a mixture model by expectation maximization to discover motifs in biopolymers. (1994)

Cached

  • Download as a PDF

Download Links

  • [www.aaai.org]
  • [www.sdsc.edu]
  • [www.ics.uci.edu]
  • [www.biosino.org]
  • [www.cs.utoronto.ca]
  • [www.cs.utoronto.ca]
  • [www.biosino.org]
  • [www.cs.toronto.edu]
  • [www.cs.cmu.edu]
  • [ai.stanford.edu]
  • [www.sdsc.edu]
  • [www.sdsc.edu]

  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Timothy L Bailey , Charles Elkan
Venue:Proc Int Conf Intell Syst Mol Biol
Citations:945 - 5 self
  • Summary
  • Citations
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@INPROCEEDINGS{Bailey94fittinga,
    author = {Timothy L Bailey and Charles Elkan},
    title = {Fitting a mixture model by expectation maximization to discover motifs in biopolymers.},
    booktitle = {Proc Int Conf Intell Syst Mol Biol},
    year = {1994},
    pages = {28--36}
}

Share

Facebook Twitter Reddit Bibsonomy

OpenURL

 

Abstract

Abstract The algorithm described in this paper discovers one or more motifs in a collection of DNA or protein sequences by using the technique of expect~tiou ma.,dmization to fit a two-component finite mixture model to the set of sequences. Multiple motifs are found by fitting a mixture model to the data, probabilistically erasing tile occurrences of the motif thus found, and repeating the process to find successive motifs. The algorithm requires only a set of unaligned sequences and a number specifying the width of the motifs as input. It returns a model of each motif and a threshold which together can be used as a Bayes-optimal classifier for searching for occurrences of the motif in other databases. The algorithm estimates how many times each motif occurs in each sequence in the dataset and outputs an alignment of the occurrences of the motif. The algorithm is capable of discovering several different motifs with differing numbers of occurrences in a single dataset.

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University