Results 1 - 10
of
45
M: Beyond synexpression relationships: local clustering of time-shifted and inverted gene expression profiles identifies new, biologically relevant interactions
- J Mol Biol
"... The complexity of biological systems provides for a great diversity of relationships between genes. The current analysis of whole-genome expression data focuses on relationships based on global correlation over a whole time-course, identifying clusters of genes whose expression levels simultaneously ..."
Abstract
-
Cited by 45 (4 self)
- Add to MetaCart
The complexity of biological systems provides for a great diversity of relationships between genes. The current analysis of whole-genome expression data focuses on relationships based on global correlation over a whole time-course, identifying clusters of genes whose expression levels simultaneously rise and fall. There are, of course, other potential relationships between genes, which are missed by such global clustering. These include activation, where one expects a time-delay between related expression pro®les, and inhibition, where one expects an inverted relationship. Here, we propose a new method, which we call local clustering, for identifying these time-delayed and inverted relationships. It is related to conventional gene-expression clustering in a fashion analogous to the way local sequence alignment (the Smith-Waterman algorithm) is derived from global alignment (Needleman-Wunsch). An integral part of our method is the use of random score distributions to assess the statistical signi®cance of each cluster. We applied our method to the yeast cellcycle
Automatic Analysis of DNA Microarray Images Using Mathematical Morphology
- Bioinformatics
, 2003
"... Motivation: DNA microarrays are an experimental technology which consists in arrays of thousands of discrete DNA sequences that are printed on glass microscope slides. Image analysis is an important aspect of microarray experiments. The aim of this step is to reduce an image of spots into a table wi ..."
Abstract
-
Cited by 43 (1 self)
- Add to MetaCart
Motivation: DNA microarrays are an experimental technology which consists in arrays of thousands of discrete DNA sequences that are printed on glass microscope slides. Image analysis is an important aspect of microarray experiments. The aim of this step is to reduce an image of spots into a table with a measure of the intensity for each spot. Efficient, accurate and automatic analysis of DNA spot images is essential in order to use this technology in laboratory routines. Results: We present an automatic non-supervised set of algorithms for a fast and accurate spot data extraction from DNA microarrays using morphological operators which are robust to both intensity variation and artefacts. The approach can be summarised as follows. Initially, agridding algorithm yields the automatic segmentation of the microarray image into spot quadrants which are later individually analysed. Then the analysis of the spot quadrant images is achieved in five steps. First, a prequantification, the spot size distribution law is calculated. Second, the background noise extraction is performed using a morphological filtering by area. Third, an orthogonal grid provides the first approach to the spot locus. Fourth, the spot segmentation or spot boundaries definition is carried out using the watershed transformation. And fifth, the outline of detected spots allows the signal quantification or spot intensities extraction; in this respect, a noise model has been investigated. The performance of the algorithm has been compared with two packages: ScanAlyze and Genepix, showing its robustness and precision.
Microarray Results: How Accurate Are They?
- BMC Bioinformatics
, 2002
"... Background: DNA microarray technology is a powerful technique that was recently developed in order to analyze thousands of genes in a short time. Presently, microarrays, or chips, of the cDNA type and oligonucleotide type are available from several sources. The number of publications in this area ..."
Abstract
-
Cited by 26 (0 self)
- Add to MetaCart
Background: DNA microarray technology is a powerful technique that was recently developed in order to analyze thousands of genes in a short time. Presently, microarrays, or chips, of the cDNA type and oligonucleotide type are available from several sources. The number of publications in this area is increasing exponentially.
Iterative normalization of cDNA microarray data
- IEEE Trans Inf Technol Biomed
, 2002
"... Abstract—This paper describes a new approach to normalizing microarray expression data. The novel feature is to unify the tasks of estimating normalization coefficients and identifying control gene set. Unification is realized by constructing a window function over the scatter plot defining the subs ..."
Abstract
-
Cited by 11 (1 self)
- Add to MetaCart
Abstract—This paper describes a new approach to normalizing microarray expression data. The novel feature is to unify the tasks of estimating normalization coefficients and identifying control gene set. Unification is realized by constructing a window function over the scatter plot defining the subset of constantly expressed genes and by affecting optimization using an iterative procedure. The structure of window function gates contributions to the control gene set used to estimate normalization coefficients. This window measures the consistency of the matched neighborhoods in the scatter plot and provides a means of rejecting control gene outliers. The recovery of normalizational regression and control gene selection are interleaved and are realized by applying coupled operations to the mean square error function. In this way, the two processes bootstrap one another. We evaluate the technique on real microarray data from breast cancer cell lines and complement the experiment with a data cluster visualization study. Index Terms—Data normalization, dynamic programming, gene expression, gene microarray, linear regression. I.
Genome-wide analysis of mRNA stability using transcription inhibitors and microarrays reveals posttranscriptional control of ribosome biogenesis factors
- Mol. Cell. Biol
, 2004
"... Using DNA microarrays, we compared global transcript stability profiles following chemical inhibition of transcription to rpb1-1 (a temperature-sensitive allele of yeast RNA polymerase II). Among the five inhibitors tested, the effects of thiolutin and 1,10-phenanthroline were most similar to rpb1-1 ..."
Abstract
-
Cited by 8 (0 self)
- Add to MetaCart
Using DNA microarrays, we compared global transcript stability profiles following chemical inhibition of transcription to rpb1-1 (a temperature-sensitive allele of yeast RNA polymerase II). Among the five inhibitors tested, the effects of thiolutin and 1,10-phenanthroline were most similar to rpb1-1. A comparison to various microarray data already in the literature revealed similarity between mRNA stability profiles and the transcriptional response to stresses such as heat shock, consistent with the fact that the general stress response includes a transient shutoff of general mRNA transcription. Genes encoding factors involved in rRNA synthesis and ribosome assembly, which are often observed to be coordinately down-regulated in yeast microarray data, were among the least stable transcripts. We examined the effects of deletions of genes encoding deadenylase components Ccr4p and Pan2p and putative RNA-binding proteins Pub1p and Puf4p on the genome-wide pattern of mRNA stability after inhibition of transcription by chemicals and/or heat stress. This examination showed that Ccr4p, the major yeast mRNA deadenylase, contributes to the degradation of transcripts encoding both ribosomal proteins and rRNA synthesis and ribosome assembly factors and mediates a large part of the transcriptional response to heat stress. Pan2p and Puf4p also contributed to the degradation rate of these mRNAs following transcriptional shutoff, while Pub1p preferentially stabilized transcripts encoding ribosomal proteins. Our results indicate that the abundance of ribosome biogenesis factors is controlled at the level of
Extracting gene networks for low-dose radiation using graph theoretical algorithms, PLoS
- Computational Biology
, 2006
"... Genes with common functions often exhibit correlated expression levels, which can be used to identify sets of interacting genes from microarray data. Microarrays typically measure expression across genomic space, creating a massive matrix of co-expression that must be mined to extract only the most ..."
Abstract
-
Cited by 6 (0 self)
- Add to MetaCart
Genes with common functions often exhibit correlated expression levels, which can be used to identify sets of interacting genes from microarray data. Microarrays typically measure expression across genomic space, creating a massive matrix of co-expression that must be mined to extract only the most relevant gene interactions. We describe a graph theoretical approach to extracting co-expressed sets of genes, based on the computation of cliques. Unlike the results of traditional clustering algorithms, cliques are not disjoint and allow genes to be assigned to multiple sets of interacting partners, consistent with biological reality. A graph is created by thresholding the correlation matrix to include only the correlations most likely to signify functional relationships. Cliques computed from the graph correspond to sets of genes for which significant edges are present between all members of the set, representing potential members of common or interacting pathways. Clique membership can be used to infer function about poorly annotated genes, based on the known functions of better-annotated genes with which they share clique membership (i.e., ‘‘guilt-by-association’’). We illustrate our method by applying it to microarray data collected from the spleens of mice exposed to low-dose ionizing radiation. Differential analysis is used to identify sets of genes whose interactions are impacted by radiation exposure. The correlation graph is also queried independently of clique to extract edges that are impacted by radiation. We present several examples of multiple gene interactions that are altered by radiation
Mapping of transcription factor binding regions in mammalian cells by ChIP: comparison of array- and sequencing-based technologies. Genome Res 17
, 2007
"... transcription factor binding regions Mapping of transcription factor (TF) binding regions has provided tremendous insight into our understanding of gene expression regulatory networks. Recent progress in this field can largely be credited to the application of chromatin immunoprecipitation (ChIP) te ..."
Abstract
-
Cited by 5 (1 self)
- Add to MetaCart
transcription factor binding regions Mapping of transcription factor (TF) binding regions has provided tremendous insight into our understanding of gene expression regulatory networks. Recent progress in this field can largely be credited to the application of chromatin immunoprecipitation (ChIP) technologies. We compared strategies for mapping TF binding regions in mammalian cells using two different ChIP schemes: ChIP followed by DNA microarray analysis (ChIP-Chip) and ChIP followed by DNA sequencing (ChIP-PET). In these studies we first investigated parameters central to obtaining robust ChIP-chip datasets through the analysis of STAT1 targets in the ENCODE-designated regions of the human genome, and then compared ChIP-chip to ChIP-PET. We devised methods for scoring and comparing results among various tiling arrays and examined parameters such as DNA microarray format (oligonucleotide or PCR product elements), oligonucleotide length, hybridization conditions, and the use of competitor Cot-1 DNA to determine which among these variables enhances ChIP-chip performance in the detection of TF binding regions. The best performance was achieved with high density
An evaluation of a system that recommends microarray experiments to perform to discover gene-regulation pathways
- Journal Artificial Intelligence in Medicine
, 2003
"... The main topic of this paper is modeling the expected value of experimentation for discovering causal pathways in gene expression data. By experimentation we mean both interventions (e.g., a gene knock-out experiment) and observations (e.g., passively observing the expression level of a “wild-type ” ..."
Abstract
-
Cited by 4 (0 self)
- Add to MetaCart
The main topic of this paper is modeling the expected value of experimentation for discovering causal pathways in gene expression data. By experimentation we mean both interventions (e.g., a gene knock-out experiment) and observations (e.g., passively observing the expression level of a “wild-type ” gene). We introduce a system called GEEVE (causal discovery in Gene Expression data using Expected Value of Experimentation), which implements expected value of experimentation in discovering causal pathways using gene expression data. GEEVE provides the following assistance, which is intended to help biologists in their quest to discover gene-regulation pathways: • Recommending which experiments to perform (with a focus on “knock-out ” experiments) using an expected value of experimentation (EVE) method. • Recommending the number of measurements (observational and experimental) to include in the experimental design, again using an EVE method. • Providing a Bayesian analysis that combines prior knowledge with the results of recent microarray experimental results to derive posterior probabilities of gene regulation
A two-way semi-linear model for normalization and significant analysis of cDNA microarray data
- Jour. Ameri. Statist.Assoc.,100,814-829
, 2005
"... 1 ABSTRACT A basic question in analyzing cDNA microarray data is normalization. The purpose of normalization is to remove systematic bias in the observed expression values by establishing a normalization curve across the whole dynamic range. A proper normalization procedure ensures that the normaliz ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
1 ABSTRACT A basic question in analyzing cDNA microarray data is normalization. The purpose of normalization is to remove systematic bias in the observed expression values by establishing a normalization curve across the whole dynamic range. A proper normalization procedure ensures that the normalized intensity ratios provide meaningful measures of relative expression levels. We propose a two-way semi-linear model (TW-SLM) for normalization and significant analysis of microarray data. This method does not make the usual assumptions underlying some of the existing methods. For example, it does not assume that: (i) the percentage of differentially expressed genes is small; or (ii) there is symmetry in the expression levels of up- and down-regulated genes, as required in the lowess normalization method. The TW-SLM also naturally incorporates uncertainty due to normalization into significant analysis of microarrays. We use a semiparametric approach based on polynomial splines in the TW-SLM to estimate the normalization curves and the normalized expression values. We also conduct simulation studies to evaluate the TW-SLM method and illustrate the proposed method using a published microarray data set. KEY WORDS: differentially expressed genes; microarray; high-dimensional data; semiparametric regression; spline; analysis of variance; noise level; variance estimation. 2 1
Probabilistic Segmentation and Intensity Estimation for Microarray Images
, 2005
"... We describe a probabilistic approach to simultaneous image segmentation and intensity estimation for cDNA microarray experiments. The approach overcomes several limitations of existing methods. In particular it a) uses a flexible Markov random field approach to segmentation that allows for a wider r ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
We describe a probabilistic approach to simultaneous image segmentation and intensity estimation for cDNA microarray experiments. The approach overcomes several limitations of existing methods. In particular it a) uses a flexible Markov random field approach to segmentation that allows for a wider range of spot shapes than existing methods, including relatively-common ”doughnut-shaped ” spots; b) models the image directly as background plus hybridization intensity, and estimates the two quantities simultaneously, avoiding the common logical error that estimates of foreground may be less than those of corresponding background if the two are estimated separately; c) uses a probabilistic modelling approach to simultaneously perform segmentation and intensity estimation, and to compute spot quality measures. We describe two approaches to parameter estimation: a fast algorithm, based on the Expectation-Maximisation (EM) and the Iterated Conditional Modes (ICM) algorithms, and a fully Bayesian framework. These approaches produce comparable results, and both appear to offer some advantages over other methods. We use an HIV experiment to compare our approach to two commercial software products: Spot and Arrayvision.

