Results 1 - 10
of
24
BAliBASE 3.0: latest developments of the multiple sequence alignment benchmark
- Proteins
, 2005
"... ABSTRACT Multiple sequence alignment is one of the cornerstones of modern molecular biology. It is used to identify conserved motifs, to determine protein domains, in 2D/3D structure prediction by homology and in evolutionary studies. Recently, high-throughput technologies such as genome sequencing ..."
Abstract
-
Cited by 48 (1 self)
- Add to MetaCart
ABSTRACT Multiple sequence alignment is one of the cornerstones of modern molecular biology. It is used to identify conserved motifs, to determine protein domains, in 2D/3D structure prediction by homology and in evolutionary studies. Recently, high-throughput technologies such as genome sequencing and structural proteomics have lead to an explosion in the amount of sequence and structure information available. In response, several new multiple alignment methods have been developed that improve both the efficiency and the quality of protein alignments. Consequently, the benchmarks used to evaluate and compare these methods must also evolve. We present here the latest release of the most widely used multiple alignment benchmark, BAliBASE, which provides high quality, manually refined, reference alignments based on 3D structural superpositions. Version 3.0 of BAliBASE includes new, more challenging test cases, representing the real problems encountered when aligning large sets of complex sequences. Using a novel, semiautomatic update protocol, the number of protein families in the benchmark has been increased and representative test cases are now available that cover most of the protein fold space. The total number of proteins in BAliBASE has also been significantly increased from 1444 to 6255 sequences. In addition, full-length sequences are now provided for all test cases, which represent difficult cases for both global and local alignment programs. Finally, the BAliBASE Web site
PANDORA: keyword-based analysis of protein sets by integration of annotation sources
- Nucleic Acids Res
, 2003
"... Recent advances in high-throughput methods and the application of computational tools for automatic classi®cation of proteins have made it possible to carry out large-scale proteomic analyses. Biological analysis and interpretation of sets of proteins is a time-consuming undertaking carried out manu ..."
Abstract
-
Cited by 9 (5 self)
- Add to MetaCart
Recent advances in high-throughput methods and the application of computational tools for automatic classi®cation of proteins have made it possible to carry out large-scale proteomic analyses. Biological analysis and interpretation of sets of proteins is a time-consuming undertaking carried out manually by experts. We have developed PANDORA (Protein ANnotation Diagram ORiented Analysis), a webbased tool that provides an automatic representation of the biological knowledge associated with any set of proteins. PANDORA uses a unique approach of keyword-based graphical analysis that focuses on detecting subsets of proteins that share unique biological properties and the intersections of such sets. PANDORA currently supports
Sequence comparison by sequence harmony identifies subtype specific functional sites
- Nucleic Acids Res
, 2006
"... identifies subtype-specific functional sites ..."
HHsenser: exhaustive transitive profile search using HMM-HMM comparison
- Nucleic Acids Res
, 2006
"... doi:10.1093/nar/gkl195 ..."
DR: MINER: software for phylogenetic motif identification
- Nucleic Acids Res
"... MINER is web-based software for phylogenetic motif (PM) identification. PMs are sequence regions (fragments) that conserve the overall familial phylogeny. PMs have been shown to correspond to a wide variety of catalytic regions, substrate-binding sites and protein interfaces, making them ideal funct ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
MINER is web-based software for phylogenetic motif (PM) identification. PMs are sequence regions (fragments) that conserve the overall familial phylogeny. PMs have been shown to correspond to a wide variety of catalytic regions, substrate-binding sites and protein interfaces, making them ideal functional site predictions. The MINER output provides an intuitive interface for interactive PM sequence analysis and structural visualization. The web implementation of MINER is freely available at
Fast dynamics perturbation analysis for prediction of protein functional sites
"... Background. We present a fast version of the dynamics perturbation analysis (DPA)
algorithm to predict functional sites in protein structures. The original DPA algorithm
finds regions in proteins where interactions cause a large change in the protein
conformational distribution, as measured using th ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
Background. We present a fast version of the dynamics perturbation analysis (DPA)
algorithm to predict functional sites in protein structures. The original DPA algorithm
finds regions in proteins where interactions cause a large change in the protein
conformational distribution, as measured using the relative entropy Dx. Such regions are
associated with functional sites.
Results. The Fast DPA algorithm, which accelerates DPA calculations, is motivated by
an empirical observation that Dx in a normal-modes model is highly correlated with an
entropic term that only depends on the eigenvalues of the normal modes. The eigenvalues
are accurately estimated using first-order perturbation theory, resulting in a N-fold
reduction in the overall computational requirements of the algorithm, where N is the
number of residues in the protein. The performance of the original and Fast DPA
algorithms was compared using protein structures from a standard small-molecule
docking test set. For nominal implementations of each algorithm, top-ranked Fast DPA
predictions overlapped the true binding site 94% of the time, compared to 87% of the
time for original DPA. In addition, per-protein recall statistics (fraction of binding-site
residues that are among predicted residues) were slightly better for Fast DPA. On the
other hand, per-protein precision statistics (fraction of predicted residues that are among
binding-site residues) were slightly better using original DPA. Overall, the performance
of Fast DPA in predicting ligand-binding-site residues was comparable to that of the
original DPA algorithm.
Conclusions. Compared to the original DPA algorithm, the decreased run time with
comparable performance makes Fast DPA well-suited for implementation on a web
server and for high-throughput analysis.
TreeDet: a web server to explore sequence space
, 2006
"... The TreeDet (Tree Determinant) Server is the first release of a system designed to integrate results from methods that predict functional sites in protein families. These methods take into account the relation between sequence conservation and evolutionary importance. TreeDet fully analyses the spac ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
The TreeDet (Tree Determinant) Server is the first release of a system designed to integrate results from methods that predict functional sites in protein families. These methods take into account the relation between sequence conservation and evolutionary importance. TreeDet fully analyses the space of protein sequences in either user-uploaded or automatically generated multiple sequence alignments. The methods implemented in the server represent three main classes of methods for the detection of family-dependent conserved positions, a tree-based method, a correlation based method and a method that employs a principal component analyses coupled to a cluster algorithm. An additional method is provided to highlight the reliability of the position in the alignments. The server is available at
D: Improved phylogenetic motif detection using parsimony
- Proceedings of the IEEE BIBE Meeting 2005
"... We have recently demonstrated (La et al, Proteins, 58:2005) that sequence fragments approximating the overall familial phylogeny, called phylogenetic motifs (PMs), represent a promising protein functional site prediction strategy. Previous results across a structurally and functionally diverse datas ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
We have recently demonstrated (La et al, Proteins, 58:2005) that sequence fragments approximating the overall familial phylogeny, called phylogenetic motifs (PMs), represent a promising protein functional site prediction strategy. Previous results across a structurally and functionally diverse dataset indicate that phylogenetic motifs correspond to a wide variety of known functional characteristics. Phylogenetic motifs are detected using a sliding window algorithm that compares neighbor joining trees on the complete alignment to those on the sequence fragments. In this investigation we identify PMs using heuristic maximum parsimony trees. We show that when using parsimony the functional site prediction accuracy of PMs improves substantially, particularly on divergent datasets. We also show that the new PMs found using parsimony are not necessarily conserved in sequence, and, therefore, would not be detected by traditional motif (information content-based) approaches. 1.
BMC Bioinformatics BioMed Central Research article
, 2007
"... Background frequencies for residue variability estimates: BLOSUM ..."
unknown title
, 2008
"... This Provisional PDF corresponds to the article as it appeared upon acceptance. Fully formatted PDF and full text (HTML) versions will be made available soon. Detection of protein catalytic residues at high precision using local network properties BMC Bioinformatics 2008, 9:517 doi:10.1186/1471-2105 ..."
Abstract
- Add to MetaCart
This Provisional PDF corresponds to the article as it appeared upon acceptance. Fully formatted PDF and full text (HTML) versions will be made available soon. Detection of protein catalytic residues at high precision using local network properties BMC Bioinformatics 2008, 9:517 doi:10.1186/1471-2105-9-517

