• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

A memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an RNA secondary structure (2002)

by S Eddy
Venue:BMC Bioinformatics
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 31
Next 10 →

Rfam: An RNA family database

by Sam Griffiths-Jones, Alex Bateman, Mhairi Marshall, Ajay Khanna, Sean R. Eddy, Sean R - Nucleic Acids Res , 2003
"... Rfam is a collection of multiple sequence alignments and covariance models representing non-coding RNA families. Rfam is available on the web in the UK at http://www.sanger.ac.uk/Software/Rfam/ and in the US at http://rfam.wustl.edu/. These websites allow the user to search a query sequence against ..."
Abstract - Cited by 114 (1 self) - Add to MetaCart
Rfam is a collection of multiple sequence alignments and covariance models representing non-coding RNA families. Rfam is available on the web in the UK at http://www.sanger.ac.uk/Software/Rfam/ and in the US at http://rfam.wustl.edu/. These websites allow the user to search a query sequence against a library of covariance models, and view multiple sequence alignments and family annotation. The database can also be downloaded in flatfile form and searched locally using the INFERNAL package (http://infernal.wustl.edu/). The first release of Rfam (1.0) contains 25 families, which annotate over 50000 non-coding RNA genes in the taxonomic divisions of the EMBL nucleotide database.

RSEARCH: Finding homologs of single structured RNA sequences

by Robert J. Klein, Sean R. Eddy - BMC Bioinformatics , 2003
"... Background: Many trans-acting noncoding RNA genes and cis-acting RNA regulatory elements conserve secondary structure rather than primary sequence. Most homology search tools only look at the primary sequence level, however. ..."
Abstract - Cited by 83 (0 self) - Add to MetaCart
Background: Many trans-acting noncoding RNA genes and cis-acting RNA regulatory elements conserve secondary structure rather than primary sequence. Most homology search tools only look at the primary sequence level, however.

Faster Genome Annotation of Non-coding RNA Families without Loss of Accuracy

by Zasha Weinberg, Walter L. Ruzzo - In Proceedings of the Eighth Annual International Conference on Computational Molecular Biology (RECOMB , 2004
"... RNA molecules that do not code for proteins. Covariance Models (CMs) are a useful statistical tool to find new members of an ncRNA gene family in a large genome database, using both sequence and, importantly, RNA secondary structure information. Unfortunately, CM searches are slow. This paper shows ..."
Abstract - Cited by 23 (7 self) - Add to MetaCart
RNA molecules that do not code for proteins. Covariance Models (CMs) are a useful statistical tool to find new members of an ncRNA gene family in a large genome database, using both sequence and, importantly, RNA secondary structure information. Unfortunately, CM searches are slow. This paper shows how to make CMs faster while provably sacrificing none of their accuracy. Specifically, based on the CM, our software builds a profile hidden Markov model (HMM), which filters the genome database. This HMM is a rigorous filter, i.e., its filtering eliminates only sequences that provably could not be annotated as homologs. The CM is run only on what remains. Optimizing the HMM for filtering involves minimizing an exponential objective # Dept. of Computer Science & Engineering, University of Washington, Box 352350, Seattle, WA, USA, 98195, zasha@cs.washington.edu + Depts. of Computer Science & Engineering and Genome Sciences, University of Washington, Box 352350, Seattle, WA, USA, 98195, ruzzo@cs.washington.edu c ACM, 2004. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version will be published in Proc. Eighth Annual Inter. Conf. on Computational Molecular Biology (RECOMB) , 2004. See http://recomb04.sdsc.edu/.

Human microRNA prediction through a probabilistic co-learning model of sequence and structure

by Jin-wu Nam, Ki-roo Shin, Jinju Han, Yoontae Lee, V. Narry Kim, Byoung-tak Zhang - Nucleic Acids Res , 2005
"... and structure ..."
Abstract - Cited by 19 (1 self) - Add to MetaCart
and structure

Evolutionary Patterns of Non-Coding RNAs

by Athanasius F. Bompfünewerer , Christoph Flamm , Claudia Fried , Guido Fritzsch , Ivo L. Hofacker , Jörg Lehmann , Kristin Missal , Axel Mosig , Bettina Müller , Sonja J. Prohaska , Bärbel M. R. Stadler , Peter F. Stadler , Andrea Tanzer , Stefan Washietl , Christina Witwer , 2005
"... A plethora of new functions of non-coding RNAs have been discovered in past few years. In fact, RNA is emerging as the central player in cellular regulation, taking on active roles in multiple regulatory layers from transcription, RNA maturation, and RNA modification to translational regulation. Ne ..."
Abstract - Cited by 12 (4 self) - Add to MetaCart
A plethora of new functions of non-coding RNAs have been discovered in past few years. In fact, RNA is emerging as the central player in cellular regulation, taking on active roles in multiple regulatory layers from transcription, RNA maturation, and RNA modification to translational regulation. Nevertheless, very little is known about the evolution of this “Modern RNA World ” and its components. In this contribution we attempt to provide at least a cursory overview of the diversity of non-coding RNAs and functional RNA motifs in non-translated regions of regular messenger RNAs (mRNAs) with an emphasis on evolutionary questions. This survey is complemented by an in-depth analysis of examples from different classes of RNAs focusing mostly on their evolution in the vertebrate lineage. We present a survey of Y RNA genes in vertebrates, studies of the molecular evolution of the U7 snRNA, the snoRNAs E1/U17, E2, and E3, the Y RNA family, the let-7 microRNA family, and the mRNA-like evf-1 gene. We furthermore discuss the statistical distribution

Fast pairwise structural RNA alignments by pruning of the dynamical programming matrix. PLoS Comput Biol 3

by Jakob H. Havgaard, Elfar Torarinsson, Jan Gorodkin , 2007
"... It has become clear that noncoding RNAs (ncRNA) play important roles in cells, and emerging studies indicate that there might be a large number of unknown ncRNAs in mammalian genomes. There exist computational methods that can be used to search for ncRNAs by comparing sequences from different genome ..."
Abstract - Cited by 10 (4 self) - Add to MetaCart
It has become clear that noncoding RNAs (ncRNA) play important roles in cells, and emerging studies indicate that there might be a large number of unknown ncRNAs in mammalian genomes. There exist computational methods that can be used to search for ncRNAs by comparing sequences from different genomes. One main problem with these methods is their computational complexity, and heuristics are therefore employed. Two heuristics are currently very popular: pre-folding and pre-aligning. However, these heuristics are not ideal, as pre-aligning is dependent on sequence similarity that may not be present and pre-folding ignores the comparative information. Here, pruning of the dynamical programming matrix is presented as an alternative novel heuristic constraint. All subalignments that do not exceed a length-dependent minimum score are discarded as the matrix is filled out, thus giving the advantage of providing the constraints dynamically. This has been included in a new implementation of the FOLDALIGN algorithm for pairwise local or global structural alignment of RNA sequences. It is shown that time and memory requirements are dramatically lowered while overall performance is maintained. Furthermore, a new divide and conquer method is introduced to limit the memory requirement during global alignment and backtrack of local alignment. All branch points in the computed RNA structure are found and used to divide the structure into smaller unbranched segments. Each segment is then realigned and backtracked in a normal fashion. Finally, the FOLDALIGN algorithm has also been updated with a better memory implementation and an improved energy model. With these improvements in the algorithm, the FOLDALIGN software package provides the molecular biologist with an efficient and user-friendly tool for searching for new ncRNAs. The software package is available for download at

Principles and Limitations of Computational MicroRNA Gene and Target Finding

by Morten Lindow, Jan Gorodkin , 2007
"... In 2001 there were four PubMed entries matching the word ‘‘microRNA’’ (miRNA). Interestingly, this number has now far exceeded 1300 and is still rapidly increasing. This more than anything demonstrates the extreme attention this field has had within a short period of time. With the large amounts of ..."
Abstract - Cited by 5 (2 self) - Add to MetaCart
In 2001 there were four PubMed entries matching the word ‘‘microRNA’’ (miRNA). Interestingly, this number has now far exceeded 1300 and is still rapidly increasing. This more than anything demonstrates the extreme attention this field has had within a short period of time. With the large amounts of sequence data being generated, the need for analysis by computational approaches is obvious. Here, we review the general principles used in computational gene and target finding, and discuss the strengths and weaknesses of the methods. Several methods rely on detection of evolutionary conserved candidates, but recent methods have challenged this paradigm by simultaneously searching for the gene and the corresponding target(s). Whereas the early methods made predictions based on sets of hand-derived rules from precursor-miRNA structure or observed target–miRNA interactions, recent methods apply machine learning techniques. Even though these methods are already powerful, the amount of data they rely on is still limited. Since it is evident that data are continuously being generated, it must be anticipated that these methods will further improve their performance.

RALEE–RNA ALignment Editor in Emacs

by Sam Griffiths-jones - Bioinformatics , 2005
"... Summary: Production of high quality multiple sequence alignments of structured RNAs relies on an iterative combination of manual editing and structure prediction. An essential feature of an RNA alignment editor is the facility to mark-up the alignment based on how it matches a given secondary struct ..."
Abstract - Cited by 4 (0 self) - Add to MetaCart
Summary: Production of high quality multiple sequence alignments of structured RNAs relies on an iterative combination of manual editing and structure prediction. An essential feature of an RNA alignment editor is the facility to mark-up the alignment based on how it matches a given secondary structure prediction, but few available alignment editors offer such a feature. The RALEE (RNA ALignment Editor in Emacs) tool provides a simple environment for RNA multiple sequence alignment editing, including structure-specific colour schemes, utilizing helper applications for structure prediction and many more conventional editing functions. This is accomplished by extending the commonly used text editor, Emacs, which is available for Linux, most UNIX systems, Windows and

The tmRDB and SRPDB resources

by Ebbe Sloth Andersen, Magnus Alm Rosenblad, Niels Larsen, Jesper Cairo Westergaard, Jody Burks, Iwona K. Wower, Jacek Wower, Jan Gorodkin, Tore Samuelsson, Christian Zwieb , 2005
"... ..."
Abstract - Cited by 3 (0 self) - Add to MetaCart
Abstract not found

Accurate Multiple Sequence-Structure Alignment of RNA Sequences Using Combinatorial Optimization

by Markus Bauer, Gunnar W. Klau, Knut Reinert , 2007
"... Background: The discovery of functional non-coding RNA sequences has led to an increasing interest in algorithms related to RNA analysis. Traditional sequence alignment algorithms, however, fail at computing reliable alignments of low-homology RNA sequences. The spatial conformation of RNA sequences ..."
Abstract - Cited by 3 (0 self) - Add to MetaCart
Background: The discovery of functional non-coding RNA sequences has led to an increasing interest in algorithms related to RNA analysis. Traditional sequence alignment algorithms, however, fail at computing reliable alignments of low-homology RNA sequences. The spatial conformation of RNA sequences largely determines their function, and therefore RNA alignment algorithms have to take structural information into account. Results: We present a graph-based representation for sequence-structure alignments, which we model as an integer linear program (ILP). We sketch how we compute an optimal or near-optimal solution to the ILP using methods from combinatorial optimization, and present results on a recently published benchmark set for RNA alignments. Conclusions: The implementation of our algorithm yields better alignments in terms of two published scores than the other programs that we tested: This is especially the case with an increasing number of input
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University