• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

HMMSTR: A hidden Markov model for local sequence-structure correlations in proteins (0)

by C Bystroff, V Thorsson, D Baker
Venue:J. Mol. Biol
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 129
Next 10 →

Review: Protein Secondary Structure Prediction Continues to Rise

by Burkhard Rost - J. Struct. Biol , 2001
"... f prediction accuracy? We shall see. 2001 Academic Press INTRODUCTION History. Linus Pauling correctly guessed the formation of helices and strands (14, 15) (and falsely hypothesized other structures). Three years before Pauling's guess was verified by the publications of the first X-ray stru ..."
Abstract - Cited by 180 (22 self) - Add to MetaCart
f prediction accuracy? We shall see. 2001 Academic Press INTRODUCTION History. Linus Pauling correctly guessed the formation of helices and strands (14, 15) (and falsely hypothesized other structures). Three years before Pauling's guess was verified by the publications of the first X-ray structures (16, 17), one group had already ventured to predict secondary structure from sequence (18). The first-generation prediction methods following in the 1960s and 1970s were all based on single amino acid propensities (19). The second-generation methods dominating the scene until the early 1990s used propensities for segments of 3--51 adjacent residues (19). Basically any imaginable theoretical algorithm had been applied to the problem of predicting secondary structure from sequence. However, it seemed that prediction accuracy stalled at levels slightly above 60% (percentage of residues predicted correctly in one of the three states: helix, strand, and other). The reason for this limit was the
(Show Context)

Citation Context

...ure ambivalent in response to global changes (1); DSSP, database and method converting 3D coordinates into secondary structure (2); HMMSTR, hidden Markov model-based prediction of secondary structure =-=(3)-=-; JPred, method combining other prediction methods (4, 5); JPred2, divergent profile (PSI-BLAST)-based neural network prediction (6); PHD, simple profile-based neural network prediction (7); PHDpsi, d...

Hidden markov models that use predicted local structure for fold recognition: alphabets of backbone geometry

by Rachel Karchin, Yael Mandel-gutfreund, Melissa Cline, Kevin Karplus - Proteins , 2003
"... An important problem in computational biology is predicting the structure of the large number of pu-tative proteins discovered by genome sequencing projects. Fold-recognition methods attempt to solve the problem by relating the target proteins to known structures, searching for template proteins hom ..."
Abstract - Cited by 70 (11 self) - Add to MetaCart
An important problem in computational biology is predicting the structure of the large number of pu-tative proteins discovered by genome sequencing projects. Fold-recognition methods attempt to solve the problem by relating the target proteins to known structures, searching for template proteins homologous to the target. Remote homologs which may have significant structural similarity are often not detectable by sequence similarities alone. To address this, we incorporated predicted local structure, a generalization of secondary structure, into two-track profile HMMs. We did not rely on a simple helix-strand-coil definition of secondary structure,
(Show Context)

Citation Context

...t provides little information about the coil category that accounts for 45% of protein structure on average [8]. Although there are many methods for defining fine-grained alphabets of local structure =-=[9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23]-=-, there has not been much work exploring whether these alphabets can be used to improve fold recognition or alignments. There have been previous attempts to improve fold recognition and target-templat...

A hidden Markov model derived structural alphabet for proteins

by A. C. Camproux, P. Tufféry - J Mol Biol , 2004
"... Understanding and predicting protein structures depend on the complexity and the accuracy of the models used to represent them. We have recently set up a Hidden Markov Model to optimally compress protein three-dimensional conformations into a one-dimensional series of letters of a structural alphabe ..."
Abstract - Cited by 51 (10 self) - Add to MetaCart
Understanding and predicting protein structures depend on the complexity and the accuracy of the models used to represent them. We have recently set up a Hidden Markov Model to optimally compress protein three-dimensional conformations into a one-dimensional series of letters of a structural alphabet. Such a model learns simultaneously the shape of representative structural letters describing the local conformation and the logic of their connections, i.e. the transition matrix between the letters. Here, we move one step further and report some evidence that such a model of protein local architecture also captures some accurate amino acid features. All the letters have specific and distinct amino acid distributions. Moreover, we show that words of amino acids can have significant propensities for some letters. Perspectives point towards the prediction of the series of letters describing the structure of a protein from its amino acid sequence. D 2005 Elsevier B.V. All rights reserved.
(Show Context)

Citation Context

...n specific of each letter of the structural alphabet exists and that we have not over-split sequence information. In terms of protein structure prediction, simple local libraries, such as the I-sites =-=[27,28]-=-, introduced as local constraints on protein available conformations, have shown their usefulness for improving ab initio prediction [29,30]. Our results tend to prove that accurate HMM-SA is well sui...

EVA: Large-Scale Analysis of Secondary Structure Prediction

by Burkhard Rost, Volker A. Eyrich , 2001
"... EVAisaweb-basedserverthat evaluatesautomaticstructurepredictionservers continuouslyandobjectively.SinceJune2000,EVA collectedmorethan20,000secondarystructurepredictions. TheEVAsetssufficedtoconcludethatthe fieldofsecondarystructurepredictionhasadvancedagain. Accuracyincreasedsubstantiallyin the1990s ..."
Abstract - Cited by 49 (10 self) - Add to MetaCart
EVAisaweb-basedserverthat evaluatesautomaticstructurepredictionservers continuouslyandobjectively.SinceJune2000,EVA collectedmorethan20,000secondarystructurepredictions. TheEVAsetssufficedtoconcludethatthe fieldofsecondarystructurepredictionhasadvancedagain. Accuracyincreasedsubstantiallyin the1990sthroughusingevolutionaryinformation takenfromthedivergenceofproteinsinthesame structuralfamily.Recently,theevolutionaryinformationresultingfromimprovedsearchesandlarger databaseshasagainboostedpredictionaccuracyby morethan4%toitscurrentheightaround76%ofall residuespredictedcorrectlyinoneofthethree states:helix,strand,orother.Thebestcurrent methodssolvedmostoftheproblemsraisedat earlierCASPmeetings:Allgoodmethodsnowget segmentsrightandperformwellonstrands.Isthe recentincreaseinaccuracysignificantenoughto makepredictionsevenmoreuseful?Webelievethe answerisaffirmative.Whatisthelimitofprediction accuracy?Weshallsee.Alldataareavailable throughtheEVAwebsiteat{cubic.bioc.columbia. edu/eva/}.Therawdatafortheresultspresentedare availableat{eva}/sec/bup_common/2001_02_22/. Proteins2001;Suppl5:192--199.2002Wiley-Liss,Inc. Keywords:automaticevaluation;large-scaleassessment; proteinstructureprediction

Predicting interresidue contacts using templates and pathways

by Yu Shao, Christopher Bystroff - Proteins , 2003
"... ABSTRACT We present a novel method, HMMSTR-CM, for protein contact map predictions. Contact potentials were calculated by using HMMSTR, a hidden Markov model for local sequence structure correlations. Targets were aligned against protein templates using a Bayesian method, and contact maps were gener ..."
Abstract - Cited by 47 (5 self) - Add to MetaCart
ABSTRACT We present a novel method, HMMSTR-CM, for protein contact map predictions. Contact potentials were calculated by using HMMSTR, a hidden Markov model for local sequence structure correlations. Targets were aligned against protein templates using a Bayesian method, and contact maps were generated by using these alignments. Contact potentials then were used to evaluate these templates. An ab initio method based on the target contact potentials using a rule-based strategy to model the protein-folding pathway was developed. Fold recognition and ab initio methods were combined to produce accurate, protein-like contact maps. Pathways sometimes led to an unambiguous prediction of topology, even without using templates. The results on CASP5 targets are discussed. Also included is a brief update on the quality of fully automated ab initio predictions using the I-sites server. Proteins 2003;53:497–502.

An HMM posterior decoder for sequence feature prediction that includes homology information

by Lukas Käll, Anders Krogh, Erik L. Sonnhammer , 2005
"... Motivation: When predicting sequence features like transmembrane topology, signal peptides, coil--coil structures, protein secondary structure or genes, extra support can be gained from homologs. ..."
Abstract - Cited by 45 (3 self) - Add to MetaCart
Motivation: When predicting sequence features like transmembrane topology, signal peptides, coil--coil structures, protein secondary structure or genes, extra support can be gained from homologs.

Ab initio construction of polypeptide fragments: accuracy of loop decoy discrimination by an all-atom statistical potential and the AMBER force field with the Generalized Born solvation model

by Mark A. Depristo, Paul I. W. De Bakker, Simon C. Lovell, Tom L. Blundell - Proteins , 2003
"... ABSTRACT We describe a novel method to generate ensembles of conformations of the main-chain atoms {N, C�,C,O,C�} for a sequence of amino acids within the context of a fixed protein framework. Each conformation satisfies fundamental stereochemical restraints such as idealized geometry, favorable �/ ..."
Abstract - Cited by 34 (4 self) - Add to MetaCart
ABSTRACT We describe a novel method to generate ensembles of conformations of the main-chain atoms {N, C�,C,O,C�} for a sequence of amino acids within the context of a fixed protein framework. Each conformation satisfies fundamental stereochemical restraints such as idealized geometry, favorable �/ � angles, and excluded volume. The ensembles include conformations both near and far from the native structure. Algorithms for effective conformational sampling and constant time overlap detection permit the generation of thousands of distinct conformations in minutes. Unlike previous approaches, our method samples dihedral angles from fine-grained �/ � state sets, which we demonstrate is superior to exhaustive enumeration from coarse �/ � sets. Applied to a large set of loop structures, our method samples consistently near-native conformations, averaging 0.4, 1.1, and 2.2 A ˚ mainchain root-mean-square deviations for four, eight, and twelve residue long loops, respectively. The ensembles make ideal decoy sets to assess the discriminatory power of a selection method. Using these decoy sets, we conclude that quality of anchor geometry cannot reliably identify near-native conformations, though the selection results are comparable to previous loop prediction methods. In a subsequent study (de Bakker et al.: Proteins 2003;51: 21–40), we demonstrate that the AMBER forcefield with the Generalized Born solvation model identifies near-native conformations significantly better than previous methods. Proteins 2003;51:41–55. © 2003 Wiley-Liss, Inc. Key words: conformational sampling; conformational search algorithms; anchor geometry; decoy sets; discrete state sets; loop modeling
(Show Context)

Citation Context

...n generation comparison [RMSD, A˚ ] Coarse-grained sets Fine-grained sets Length Park Rooman Deane Moult FREAD 9 18 36 72 2 1.47 (16) 0.71 (9) 0.85 (5) 0.61 (4) — 0.60 0.38 0.32 0.30 3 1.74 (24) 1.05 =-=(14)-=- 1.14 (9) 0.87 (5) 0.43 (3) 0.85 0.43 0.33 0.32 4 2.27 (26) 1.41 (7) 1.64 (5) 1.25 (1) 0.67 (3) 0.82 0.51 0.47 0.46 5 2.37 (22) 1.63 (9) 1.65 (5) 1.28 (3) 1.11 (6) 0.89 0.56 0.53 0.52 6 3.25 (14) 2.33...

SAM-T08, HMM-based Protein Structure Prediction

by Kevin Karplus , 2009
"... The SAM-T08 web server is a protein-structure prediction server that provides several useful intermediate results in addition to the final predicted 3D structure: three multiple sequence alignments of putative homologs using different iterated search procedures, prediction of local structure feature ..."
Abstract - Cited by 32 (4 self) - Add to MetaCart
The SAM-T08 web server is a protein-structure prediction server that provides several useful intermediate results in addition to the final predicted 3D structure: three multiple sequence alignments of putative homologs using different iterated search procedures, prediction of local structure features including various backbone and burial properties, calibrated E-values for the significance of template searches of PDB, and residue-residue contact predictions. The server has been validated as part of the CASP8 assessment of structure prediction as having good performance across all classes of predictions.

Striped sheets and protein contact prediction

by Robert M. Maccallum - Bioinformatics , 2004
"... Accepted for oral presentation at ISMB/ECCB 2004 ..."
Abstract - Cited by 31 (1 self) - Add to MetaCart
Accepted for oral presentation at ISMB/ECCB 2004
(Show Context)

Citation Context

...as been explored extensively in the literature, and most notably in the work relating to the I-sites library and HMMSTR local structure prediction tool (Han and Baker, 1996; Bystroff and Baker, 1998; =-=Bystroff et al., 2000-=-). It has been shown that sequence-structure correlations exist for different categories of helix, strand and turn, and also for supersecondary structure motifs. Although this area has been well studi...

Remote homolog detection using local sequence-structure correlations

by Yuna Hou, Wynne Hsu, Mong Li Lee, Christopher Bystroff - Proteins , 2004
"... ABSTRACT Remote homology detection refers to the detection of structural homology in proteins when there is little or no sequence similarity. In this article, we present a remote homolog detection method called SVM-HMMSTR that overcomes the reliance on detectable sequence similarity by transforming ..."
Abstract - Cited by 31 (1 self) - Add to MetaCart
ABSTRACT Remote homology detection refers to the detection of structural homology in proteins when there is little or no sequence similarity. In this article, we present a remote homolog detection method called SVM-HMMSTR that overcomes the reliance on detectable sequence similarity by transforming the sequences into strings of hidden Markov states that represent local folding motif patterns. These state strings are transformed into fixeddimension feature vectors for input to a support vector machine. Two sets of features are defined: an order-independent feature set that captures the amino acid and local structure composition; and an order-dependent feature set that captures the sequential ordering of the local structures. Tests using the Structural Classification of Proteins (SCOP) 1.53 data set show that the SVM-HMMSTR gives a significant improvement over several current methods. Proteins 2004;57:518–530. © 2004 Wiley-Liss, Inc. Key words: remote homology; local structure; support vector machines; hidden Markov model; protein folding; I-sites; HMMSTR
(Show Context)

Citation Context

...iffering topology. Hence, they do not contain sufficient information to define the overall, global fold of the protein molecule. Moreover, many of the I-sites motifs tend to overlap. Bystroff et. al. =-=[6]-=- develop a set of rules to define the propagation of structure along a protein chain that have been extracted from the database of known protein structures. This is formalized as a hidden Markov model...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University