Results 1 - 10
of
59
M: Beyond synexpression relationships: local clustering of time-shifted and inverted gene expression profiles identifies new, biologically relevant interactions
- J Mol Biol
"... The complexity of biological systems provides for a great diversity of relationships between genes. The current analysis of whole-genome expression data focuses on relationships based on global correlation over a whole time-course, identifying clusters of genes whose expression levels simultaneously ..."
Abstract
-
Cited by 46 (4 self)
- Add to MetaCart
The complexity of biological systems provides for a great diversity of relationships between genes. The current analysis of whole-genome expression data focuses on relationships based on global correlation over a whole time-course, identifying clusters of genes whose expression levels simultaneously rise and fall. There are, of course, other potential relationships between genes, which are missed by such global clustering. These include activation, where one expects a time-delay between related expression pro®les, and inhibition, where one expects an inverted relationship. Here, we propose a new method, which we call local clustering, for identifying these time-delayed and inverted relationships. It is related to conventional gene-expression clustering in a fashion analogous to the way local sequence alignment (the Smith-Waterman algorithm) is derived from global alignment (Needleman-Wunsch). An integral part of our method is the use of random score distributions to assess the statistical signi®cance of each cluster. We applied our method to the yeast cellcycle
Functional modules by relating protein interaction networks and gene expression
- Nucleic Acids Res
, 2003
"... gene expression ..."
What is bioinformatics? A proposed definition and overview of the field
"... BACKGROUND: The recent flood of data from genome sequencing and functional genomics has given rise to new field, bioinformatics, which combines elements of biology and computer science. OBJECTIVES: Here we propose a definition for this new field and review some the research that is being pursued, p ..."
Abstract
-
Cited by 24 (2 self)
- Add to MetaCart
BACKGROUND: The recent flood of data from genome sequencing and functional genomics has given rise to new field, bioinformatics, which combines elements of biology and computer science. OBJECTIVES: Here we propose a definition for this new field and review some the research that is being pursued, particularly in relation to transcriptional regulatory systems. METHODS: Our definition is as follows: Bioinformatics is conceptualizing biology in terms of macromolecules (in the sense of physical-chemistry) and then applying "informatics" techniques (derived from disciplines such as applied maths, computer science, and statistics) to understand and organize the information associated with these molecules, on a large-scale. RESULTS & CONCLUSIONS: Analyses in bioinformatics predominantly focus on three types of large datasets available in molecular biology: macromolecular structures, genome sequences, and the results of functional genomics experiments (eg expression data). Additional information includes the text of scientific papers and "relationship data" from metabolic pathways, taxonomy trees, and proteinprotein interaction networks. Bioinformatics employs a wide range of computational topics including sequence and structural alignment, database design and data mining, macromolecular geometry, phylogenetic tree construction, prediction of protein structure and function, gene finding, and expression data clustering. The emphasis is on approaches that integrate a variety of computational techniques and heterogeneous data sources. Finally, bioinformatics is a practical discipline. We survey some representative applications, such as finding homologues, designing drugs, and performing large-scale censuses. Additional information pertinent to the review is available over the w...
Assessing the limits of genomic data integration for predicting protein networks. Genome Res
, 2005
"... All genomic feature data used in this study can be downloaded at ..."
Abstract
-
Cited by 24 (2 self)
- Add to MetaCart
All genomic feature data used in this study can be downloaded at
Toward a comprehensive atlas of the physical interactome of Saccharomyces cerevisiae
- Mol. Cell. Proteomics
, 2007
"... Defining protein complexes is critical to virtually all aspects of cell biology. Two recent affinity purification/mass spectrometry studies in Saccharomyces cerevisiae have vastly increased the available protein interaction data. The practical utility of such high throughput interaction sets, howeve ..."
Abstract
-
Cited by 15 (0 self)
- Add to MetaCart
Defining protein complexes is critical to virtually all aspects of cell biology. Two recent affinity purification/mass spectrometry studies in Saccharomyces cerevisiae have vastly increased the available protein interaction data. The practical utility of such high throughput interaction sets, however, is substantially decreased by the presence of false positives. Here we created a novel probabilistic metric that takes advantage of the high density of these data, including both the presence and absence of individual associations, to provide a measure of the relative confidence of each potential protein-protein interaction. This analysis largely overcomes the noise inherent in high throughput immunoprecipitation experiments. For example, of the 12,122 binary interactions in the general repository of interaction data (BioGRID) derived from these two
The importance of bottlenecks in protein networks: Correlation with gene essentiality and expression dynamics
- PLoS Comput. Biol
, 2007
"... It has been a long-standing goal in systems biology to find relations between the topological properties and functional features of protein networks. However, most of the focus in network studies has been on highly connected proteins (‘‘hubs’’). As a complementary notion, it is possible to define bo ..."
Abstract
-
Cited by 10 (1 self)
- Add to MetaCart
It has been a long-standing goal in systems biology to find relations between the topological properties and functional features of protein networks. However, most of the focus in network studies has been on highly connected proteins (‘‘hubs’’). As a complementary notion, it is possible to define bottlenecks as proteins with a high betweenness centrality (i.e., network nodes that have many ‘‘shortest paths’ ’ going through them, analogous to major bridges and tunnels on a highway map). Bottlenecks are, in fact, key connector proteins with surprising functional and dynamic properties. In particular, they are more likely to be essential proteins. In fact, in regulatory and other directed networks, betweenness (i.e., ‘‘bottleneck-ness’’) is a much more significant indicator of essentiality than degree (i.e., ‘‘hub-ness’’). Furthermore, bottlenecks correspond to the dynamic components of the interaction network—they are significantly less well coexpressed with their neighbors than nonbottlenecks, implying that expression dynamics is wired into the network topology.
Predicting Protein-Protein Interactions from Protein Domains Using a Set Cover Approach
, 2004
"... The goal of contemporary proteome research is the elucidation of protein interactions in the cell. Based on currently available protein-protein interaction and domain data of S. cerevisiae, we introduce a novel method, Maximum Specificity Set Cover (MSSC), to predict protein-protein interactions. ..."
Abstract
-
Cited by 10 (5 self)
- Add to MetaCart
The goal of contemporary proteome research is the elucidation of protein interactions in the cell. Based on currently available protein-protein interaction and domain data of S. cerevisiae, we introduce a novel method, Maximum Specificity Set Cover (MSSC), to predict protein-protein interactions. Our approach features two stages: First, we select high quality protein interactions based on a clustering measure. Second, we use MSSC to assign probabilities to domain pairs. This approach allows us to predict previously unknown protein-protein interactions with a degree of sensitivity and specificity that clearly outscores other approaches. We achieve 86% sensitivity and 62% specificity using 80% of the high quality interactions in the DIP database. We find that the predicted interaction network preserves the characteristics of the initial web of known protein interactions. We also observe high levels of co-expression among putative interactions.
What is bioinformatics? An introduction and overview
, 2001
"... A flood of data means that many of the challenges in biology are now challenges in computing. Bioinformatics, the application of computational techniques to analyse the information associated with biomolecules on a large-scale, has now firmly established itself as a discipline in molecular biology, ..."
Abstract
-
Cited by 9 (0 self)
- Add to MetaCart
A flood of data means that many of the challenges in biology are now challenges in computing. Bioinformatics, the application of computational techniques to analyse the information associated with biomolecules on a large-scale, has now firmly established itself as a discipline in molecular biology, and encompasses a wide range of subject areas from structural biology, genomics to gene expression studies. In this review we provide an introduction and overview of the current state of the field. We discuss the main principles that underpin bioinformatics analyses, look at the types of biological information and databases that are commonly used, and finally examine some of the studies that are being conducted, particularly with reference to transcription regulatory systems. 2. Introduction Biological data are flooding in at an unprecedented rate (1). For example as of August 2000, the GenBank repository of nucleic acid sequences contained 8,214,000 entries (2) and the SWISS-PROT databas...
Ontologies for proteomics - Towards a systematic definition of structure & function that scales to the genome level
"... A principle aim of structural and functional genomics is to elucidate the structures and functions of all the gene products in the genome. However, to adequately comprehend and analyze such a large amount of information we need new descriptions of proteins that scale to the genomic level. In shor ..."
Abstract
-
Cited by 9 (4 self)
- Add to MetaCart
A principle aim of structural and functional genomics is to elucidate the structures and functions of all the gene products in the genome. However, to adequately comprehend and analyze such a large amount of information we need new descriptions of proteins that scale to the genomic level. In short, we need a unified ontology for proteomics. Here we review progress towards this end, surveying the diverse approaches to systematic structural and functional classification and their progress towards developing standardized, unified descriptions for proteins. We focus particularly on systems to organize protein properties (both biophysical and biochemical) - as opposed to the classification of 3D protein folds, a subject has been reviewed extensively elsewhere.
ExpressYourself: A modular platform for processing and visualizing microarray data. Nucleic Acids Res
, 2003
"... DNA microarrays are widely used in biological research; by analyzing differential hybridization on a single microarray slide, one can detect changes in mRNA expression levels, increases in DNA copy numbers and the location of transcription factor binding sites on a genomic scale. Having performed th ..."
Abstract
-
Cited by 9 (5 self)
- Add to MetaCart
DNA microarrays are widely used in biological research; by analyzing differential hybridization on a single microarray slide, one can detect changes in mRNA expression levels, increases in DNA copy numbers and the location of transcription factor binding sites on a genomic scale. Having performed the experiments, the major challenge is to process large, noisy datasets in order to identify the specific array elements that are significantly differentially hybridized. This normally requires aggregating different, often incompatible programs into a multistep pipeline. Here we present ExpressYourself, a fully integrated platform for processing microarray data. In completely automated fashion, it will correct the background array signal, normalize the Cy5 and Cy3 signals, score levels of differential hybridization, combine the results of replicate experiments, filter problematic regions of the array and assess the quality of individual and replicate experiments. ExpressYourself is designed with a highly modular architecture so various types of microarray analysis algorithms can readily be incorporated as they are developed; for example, the system currently implements several normalization methods, including those that simultaneously consider signal intensity and slide location. The processed data are presented using a web-based graphical interface to facilitate comparison with the original images of the array slides. In particular, Express Yourself is able to regenerate images of the original microarray after applying various steps of processing, which greatly facilities identification of position-specific artifacts. The program is freely available for use at

