• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Knowledge-based protein secondary structure assignment. Proteins (1995)

by D Frishman, P Argos
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 324
Next 10 →

Can correct protein models be identified

by Björn Wallner, Arne Elofsson - Protein Sci , 2003
"... The ability to separate correct models of protein structures from less correct models is of the greatest importance for protein structure prediction methods. Several studies have examined the ability of different types of energy function to detect the native, or native-like, protein structure from a ..."
Abstract - Cited by 77 (4 self) - Add to MetaCart
The ability to separate correct models of protein structures from less correct models is of the greatest importance for protein structure prediction methods. Several studies have examined the ability of different types of energy function to detect the native, or native-like, protein structure from a large set of decoys. In contrast to earlier studies, we examine here the ability to detect models that only show limited structural similarity to the native structure. These correct models are defined by the existence of a fragment that shows significant similarity between this model and the native structure. It has been shown that the existence of such fragments is useful for comparing the performance between different fold recognition methods and that this performance correlates well with performance in fold recognition. We have developed ProQ, a neuralnetwork-based method to predict the quality of a protein model that extracts structural features, such as frequency of atom–atom contacts, and predicts the quality of a model, as measured either by LGscore or MaxSub. We show that ProQ performs at least as well as other measures when identifying the native structure and is better at the detection of correct models. This performance is maintained over several different test sets. ProQ can also be combined with the Pcons fold recognition predictor (Pmodeller) to increase its performance, with the main advantage being the elimination of a few high-scoring incorrect models. Pmodeller was successful in CASP5 and results from the latest LiveBench, LiveBench-6, indicating that Pmodeller has a higher specificity than Pcons alone.
(Show Context)

Citation Context

... exposed. The exposure data were grouped into one of the four groups, <25%, 25%–50%, 50%–75%, and >75% exposed, and finally normalized by the number of residues. Predicted secondary structure STRIDE (=-=Frishman and Argos 1995-=-) was used to assign secondary structure to the protein models based on its coordinates. Each residue was assigned to one of the three classes, helix, sheet, or coil. This assignment was compared with...

Hidden markov models that use predicted local structure for fold recognition: alphabets of backbone geometry

by Rachel Karchin, Yael Mandel-gutfreund, Melissa Cline, Kevin Karplus - Proteins , 2003
"... An important problem in computational biology is predicting the structure of the large number of pu-tative proteins discovered by genome sequencing projects. Fold-recognition methods attempt to solve the problem by relating the target proteins to known structures, searching for template proteins hom ..."
Abstract - Cited by 70 (11 self) - Add to MetaCart
An important problem in computational biology is predicting the structure of the large number of pu-tative proteins discovered by genome sequencing projects. Fold-recognition methods attempt to solve the problem by relating the target proteins to known structures, searching for template proteins homologous to the target. Remote homologs which may have significant structural similarity are often not detectable by sequence similarities alone. To address this, we incorporated predicted local structure, a generalization of secondary structure, into two-track profile HMMs. We did not rely on a simple helix-strand-coil definition of secondary structure,
(Show Context)

Citation Context

...t provides little information about the coil category that accounts for 45% of protein structure on average [8]. Although there are many methods for defining fine-grained alphabets of local structure =-=[9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23]-=-, there has not been much work exploring whether these alphabets can be used to improve fold recognition or alignments. There have been previous attempts to improve fold recognition and target-templat...

STRIDE: a web server for secondary structure assignment from known atomic coordinates of proteins

by Matthias Heinig, Dmitrij Frishman - Nucleic Acids Res , 2004
"... assignment from known atomic coordinates of proteins ..."
Abstract - Cited by 66 (1 self) - Add to MetaCart
assignment from known atomic coordinates of proteins
(Show Context)

Citation Context

...ation of backbone curvature using differential geometry. More recently, an improved version of DSSP, called DSSPcont, has been developed which takes into account the structural variations in proteins =-=(5)-=-. Our method, STRIDE (6), was developed with a specific goal to accurately reproduce secondary structure designations created by human experts. It is thus a knowledge-based approach which uses, as tra...

Cyclic coordinate descent: A robotics algorithm for protein loop closure, Protein Sci

by Adrian A. Canutescu, L. Dunbrack, Email Alerting, Adrian A. Canutescu, Roland L. Dunbrack
"... service This article cites 34 articles, 12 of which can be accessed free at: ..."
Abstract - Cited by 65 (0 self) - Add to MetaCart
service This article cites 34 articles, 12 of which can be accessed free at:
(Show Context)

Citation Context

...he PISCES server (formerly the CulledPDB server, now available at http://www.fccc.edu/research/ labs/dunbrack/pisces). The chosen loops were identified as having coil structure by the Stride program (=-=Frishman and Argos 1995-=-). None of the loops in the test set are adjacent to disordered residues as determined by our validation program S2C (http://www. fccc.edu/research/labs/dunbrack/s2c). Ramachandran probability maps We...

Large-Scale Comparison of Protein Sequence Alignment Algorithms With Structure Alignments

by J. Michael Sauder, Jonathan W. Arthur, Roland L. Dunbrack - Proteins , 2000
"... Sequence alignment programs such as BLAST and PSI-BLAST are used routinely in pairwise, profile-based, or intermediate-sequencesearch (ISS) methods to detect remote homologies for the purposes of fold assignment and comparative modeling. Yet, the sequence alignment quality of these methods at low se ..."
Abstract - Cited by 64 (2 self) - Add to MetaCart
Sequence alignment programs such as BLAST and PSI-BLAST are used routinely in pairwise, profile-based, or intermediate-sequencesearch (ISS) methods to detect remote homologies for the purposes of fold assignment and comparative modeling. Yet, the sequence alignment quality of these methods at low sequence identity is not known. We have used the CE structure alignment program (Shindyalov and Bourne, Prot Eng 1998;11: 739) to derive sequence alignments for all superfamily and family-level related proteins in the SCOP domain database. CE aligns structures and their sequences based on distances within each protein, rather than on interprotein distances. We compared BLAST, PSI-BLAST, CLUSTALW, and ISS alignments with the CE structural alignments. We found that global alignments with CLUSTALW were very poor at low sequence identity (&lt;25%), as judged by the CE alignments. We used PSI-BLAST to search the nonredundant sequence database (nr) with every sequence in SCOP using up to four iterations. The resulting matrix was used to search a database of SCOP sequences. PSI-BLAST is only slightly better than BLAST in alignment accuracy on a perresidue basis, but PSI-BLAST matrix alignments are much longer than BLAST&apos;s, and so align correctly a larger fraction of the total number of aligned residues in the structure alignments. Any two SCOP sequences in the same superfamily that shared a hit or hits in the nr PSI-BLAST searches were identified as linked by the shared intermediate sequence. We examined the quality of the longest SCOP-query/ SCOP-hit alignment via an intermediate sequence, and found that ISS produced longer alignments than PSI-BLAST searches alone, of nearly comparable per-residue quality. At 10--15% sequence identity, BLAST correctly aligns 28%, PSI-BLAST 40%, and ISS ...

Functional and structural genomics using PEDANT

by Dmitrij Frishman, Kaj Albermann, Jean Hani, Klaus Heumann, Agnes Metanomski, Alfred Zollner, Hans-werner Mewes , 2001
"... Motivation: Enormous demand for fast and accurate analysis of biological sequences is fuelled by the pace of genome analysis efforts. There is also an acute need in reliable up-to-date genomic databases integrating both functional and structural information. Here we describe the current status of th ..."
Abstract - Cited by 56 (11 self) - Add to MetaCart
Motivation: Enormous demand for fast and accurate analysis of biological sequences is fuelled by the pace of genome analysis efforts. There is also an acute need in reliable up-to-date genomic databases integrating both functional and structural information. Here we describe the current status of the PEDANT software system for highthroughput analysis of large biological sequence sets and the genome analysis server associated with it.

A hidden Markov model derived structural alphabet for proteins

by A. C. Camproux, P. Tufféry - J Mol Biol , 2004
"... Understanding and predicting protein structures depend on the complexity and the accuracy of the models used to represent them. We have recently set up a Hidden Markov Model to optimally compress protein three-dimensional conformations into a one-dimensional series of letters of a structural alphabe ..."
Abstract - Cited by 51 (10 self) - Add to MetaCart
Understanding and predicting protein structures depend on the complexity and the accuracy of the models used to represent them. We have recently set up a Hidden Markov Model to optimally compress protein three-dimensional conformations into a one-dimensional series of letters of a structural alphabet. Such a model learns simultaneously the shape of representative structural letters describing the local conformation and the logic of their connections, i.e. the transition matrix between the letters. Here, we move one step further and report some evidence that such a model of protein local architecture also captures some accurate amino acid features. All the letters have specific and distinct amino acid distributions. Moreover, we show that words of amino acids can have significant propensities for some letters. Perspectives point towards the prediction of the series of letters describing the structure of a protein from its amino acid sequence. D 2005 Elsevier B.V. All rights reserved.

EVA: evaluation of protein structure prediction servers

by Ingrid Y. Y. Koh, Volker A. Eyrich, Marc A. Marti-renom, Dariusz Przybylski, Mallur S. Madhusudhan, Narayanan Eswar, Osvaldo Graña, Florencio Pazos, Alfonso Valencia, Andrej Sali, Burkhard Rost - Nucleic Acids Research , 2003
"... EVA ..."
Abstract - Cited by 46 (11 self) - Add to MetaCart
Abstract not found
(Show Context)

Citation Context

...nding C-beta atoms (C-alpha for Gly) with a 8 A˚ cut-off. EVA-sec. Evaluates protein secondary structure predictions. Secondary structures are assigned from 3D structures through DSSP (16) and STRIDE =-=(17)-=-. EVA-sec measures accuracy by: (i) per-residue accuracy (18) (Q3)—percentage of residues correctly predicted in one of the three states (helix, strand or other); (ii) per-segment accuracy (18,19) (SO...

VIPERdb2: an enhanced and web API enabled relational database for structural virology. Nucleic Acids Res

by Craig M. Shepherd, Ian A. Borelli, Sangita Venkataraman, Gabriel L, Padmaja Natarajan, John E. Johnson, Charles L. Brooks Iii, Vijay S. Reddy , 2009
"... structural virology ..."
Abstract - Cited by 38 (0 self) - Add to MetaCart
structural virology
(Show Context)

Citation Context

... surface areas (2), (iv) Q-scores, quantitative measure of quasi-equivalence (5) and finally links to the oligomer generator, map-a-residue and secondary structure information calculated using STRIDE =-=(18)-=-. The related viruses tab provides links to all the related members in the same genus and family of the selected entry. WEB APPLICATION PROGRAMMING INTERFACE In order to expand the interoperability of...

Distinguishing enzyme structures from non-enzymes without alignments

by Paul D. Dobson, Andrew J. Doig - J. Mol. Biol , 2003
"... The ability to predict protein function from structure is becoming increasingly important as the number of structures resolved is growing more rapidly than our capacity to study function. Current methods for predicting protein function are mostly reliant on identifying a similar protein of known fun ..."
Abstract - Cited by 35 (1 self) - Add to MetaCart
The ability to predict protein function from structure is becoming increasingly important as the number of structures resolved is growing more rapidly than our capacity to study function. Current methods for predicting protein function are mostly reliant on identifying a similar protein of known function. For proteins that are highly dissimilar or are only similar to proteins also lacking functional annotations, these methods fail. Here, we show that protein function can be predicted as enzymatic or not without resorting to alignments. We describe 1178 high-resolution proteins in a structurally non-redundant subset of the Protein Data Bank using simple features such as secondary-structure content, amino acid propensities, surface properties and ligands. The subset is split into two functional groupings, enzymes and non-enzymes. We use the support vector machine-learning algorithm to develop models that are capable of assigning the protein class. Validation of the method shows that the function can be predicted to an accuracy of 77 % using 52 features to describe each protein. An adaptive search of possible subsets of features produces a simplified model based on 36 features that predicts at an accuracy of 80%. We compare the method to sequence-based methods that also avoid calculating alignments and predict a recently released set of unrelated proteins. The most useful features for distinguishing enzymes from nonenzymes are secondary-structure content, amino acid frequencies, number of disulphide bonds and size of the largest cleft. This method is applicable to any structure as it does not require the identification of sequence or structural similarity to a protein of known function.
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University