• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Normalization and analysis of DNA microarray data by self-consistency and local regression (0)

by T B Kepler, L Crosby, K T Morgan
Venue:Genome Biology
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 28
Next 10 →

Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation

by Yee Hwa Yang, Sandrine Dudoit, Percy Luu, Vivian Peng , 2002
"... There are many sources of systematic variation in cDNA microarray experiments which affect the measured gene expression levels (e.g. differences in labeling efficiency between the two fluorescent dyes). The term normalization refers to the process of removing such variation. A constant adjustment is ..."
Abstract - Cited by 194 (3 self) - Add to MetaCart
There are many sources of systematic variation in cDNA microarray experiments which affect the measured gene expression levels (e.g. differences in labeling efficiency between the two fluorescent dyes). The term normalization refers to the process of removing such variation. A constant adjustment is often used to force the distribution of the intensity log ratios to have a median of zero for each slide. However, such global normalization approaches are not adequate in situations where dye biases can depend on spot overall intensity and/or spatial location within the array. This article proposes normalization methods that are based on robust local regression and account for intensity and spatial dependence in dye biases for different types of cDNA microarray experiments. The selection of appropriate controls for normalization is discussed and a novel set of controls (microarray sample pool, MSP) is introduced to aid in intensity-dependent normalization. Lastly, to allow for comparisons of expression levels across slides, a robust method based on maximum likelihood estimation is proposed to adjust for scale differences among slides.

Statistical Issues in cDNA Microarray Data Analysis

by Gordon K. Smyth, Yee Hwa Yang, Terry Speed , 2003
"... This article summarizes some of the issues involved and provides a brief review of the analysis tools which are available to researchers to deal with them. Any microarray experiment involves a number of distinct stages. Firstly there is the design of the experiment. The researchers must decide which ..."
Abstract - Cited by 39 (2 self) - Add to MetaCart
This article summarizes some of the issues involved and provides a brief review of the analysis tools which are available to researchers to deal with them. Any microarray experiment involves a number of distinct stages. Firstly there is the design of the experiment. The researchers must decide which genes are to be printed on the arrays, which sources of RNA are to be hybridized to the arrays and on how many arrays the hybridizations will be replicated. Secondly, after hybridization, there follows a number of data-cleaning steps or `low-level analysis' of the microarray data. The microarray images must be processed to acquire red and green foreground and background intensities for each spot. The acquired red/green ratios must be normalized to adjust for dye-bias and for any systematic variation other than that due to the differences between the RNA samples being studied. Thirdly, the normalized ratios are analyzed by various graphical and numerical means to select differentially expressed genes or to find groups of genes whose expression profiles can reliably classify the different RNA sources into meaningful groups. The sections of this article correspond roughly to the various analysis steps. The following notation will be used throughout the article. The foreground red and green intensities will be written Pp and 9p for each spot. The background intensities will be Pf and 9f . The background-corrected intensities will be P and 9 where usually P Pp Pf 0 # and 9 9p 9f 0 # . The log-differential expression ratio will be vyq # E P 9 0 for each spot. Finally, the log-intensity of the spot will be vyq 3 P9 0 , a measure of the overall brightness of the spot. (The letter E is a mnemonic for minus as vyq vyq E P 9 0 # while 3 is a mnemonic for add as #vyq vyq #...

ExpressYourself: A modular platform for processing and visualizing microarray data. Nucleic Acids Res

by Nicholas M. Luscombe, Thomas E. Royce, Paul Bertone, Nathaniel Echols, Christine E. Horak, Joseph T. Chang, Michael Snyder, Mark Gerstein , 2003
"... DNA microarrays are widely used in biological research; by analyzing differential hybridization on a single microarray slide, one can detect changes in mRNA expression levels, increases in DNA copy numbers and the location of transcription factor binding sites on a genomic scale. Having performed th ..."
Abstract - Cited by 9 (5 self) - Add to MetaCart
DNA microarrays are widely used in biological research; by analyzing differential hybridization on a single microarray slide, one can detect changes in mRNA expression levels, increases in DNA copy numbers and the location of transcription factor binding sites on a genomic scale. Having performed the experiments, the major challenge is to process large, noisy datasets in order to identify the specific array elements that are significantly differentially hybridized. This normally requires aggregating different, often incompatible programs into a multistep pipeline. Here we present ExpressYourself, a fully integrated platform for processing microarray data. In completely automated fashion, it will correct the background array signal, normalize the Cy5 and Cy3 signals, score levels of differential hybridization, combine the results of replicate experiments, filter problematic regions of the array and assess the quality of individual and replicate experiments. ExpressYourself is designed with a highly modular architecture so various types of microarray analysis algorithms can readily be incorporated as they are developed; for example, the system currently implements several normalization methods, including those that simultaneously consider signal intensity and slide location. The processed data are presented using a web-based graphical interface to facilitate comparison with the original images of the array slides. In particular, Express Yourself is able to regenerate images of the original microarray after applying various steps of processing, which greatly facilities identification of position-specific artifacts. The program is freely available for use at

Analysis of Microarray Gene Expression Data

by Wolfgang Huber, Anja Von Heydebreck, Martin Vingron - in ‘Handbook of Statistical Genetics’, 2nd edn , 2003
"... This article reviews the methods utilized in processing and analysis of gene expression data generated using DNA microarrays. This type of experiment allows to determine relative levels of mRNA abundance in a set of tissues or cell populations for thousands of genes simultaneously. Naturally, suc ..."
Abstract - Cited by 8 (2 self) - Add to MetaCart
This article reviews the methods utilized in processing and analysis of gene expression data generated using DNA microarrays. This type of experiment allows to determine relative levels of mRNA abundance in a set of tissues or cell populations for thousands of genes simultaneously. Naturally, such an experiment requires computational and statistical analysis techniques. At the outset of the processing pipeline, the computational procedures are largely determined by the technology and experimental setup that are used. Subsequently, as more reliable intensity values for genes emerge, pattern discovery methods come into play. The most striking peculiarity of this kind of data is that one usually obtains measurements for thousands of genes for only a much smaller number of conditions. This is at the root of several of the statistical questions discussed here.

Model selection and efficiency testing for normalization of cdna microarray data

by Matthias E. Futschik, Toni Crompton - Genome Biol , 2004
"... We present in this study two novel normalization schemes for cDNA microarrays. They are based on iterative local regression and optimization of model parameters by generalized cross-validation. Permutation tests assessing the efficiency of normalization demonstrated that the proposed schemes have an ..."
Abstract - Cited by 8 (0 self) - Add to MetaCart
We present in this study two novel normalization schemes for cDNA microarrays. They are based on iterative local regression and optimization of model parameters by generalized cross-validation. Permutation tests assessing the efficiency of normalization demonstrated that the proposed schemes have an improved ability to remove systematic errors and to reduce variability in microarray data. The analysis also reveals that without parameter optimization local regression is frequently insufficient to remove systematic errors in microarray data. Background Microarrays have been widely used for the study of gene expression in biological and medical research. They allow the simultaneous measurement of the expression of thousands of genes in cells. However, microarrays do not assess gene expression directly, but only indirectly by monitoring fluorescence intensities of labeled target cDNA hybridized to probes on the arrays [1]. The first step in the analysis of microarray data is, therefore, the transformation of fluorescence

Brush Effects on DNA Chips: Thermodynamics, Kinetics, and Design Guidelines

by A. Halperin, A. Buhot, E. B. Zhulina Y
"... ABSTRACT In biology experiments, oligonucleotide microarrays are contacted with a solution of long nucleic acid targets. The hybridized probes thus carry long tails. When the surface density of the oligonucleotide probes is high enough, the progress of hybridization gives rise to a polyelectrolyte b ..."
Abstract - Cited by 4 (0 self) - Add to MetaCart
ABSTRACT In biology experiments, oligonucleotide microarrays are contacted with a solution of long nucleic acid targets. The hybridized probes thus carry long tails. When the surface density of the oligonucleotide probes is high enough, the progress of hybridization gives rise to a polyelectrolyte brush due to mutual crowding of the nucleic acid tails. The free-energy penalty associated with the brush modifies both the hybridization isotherms and the rate equations: the attainable hybridization is lowered significantly as is the hybridization rate. When the equilibrium hybridization fraction, xeq, is low, the hybridization follows a Langmuir type isotherm, xeq/(1 ÿ xeq) ctK where ct is the target concentration and K is the equilibrium constant. K is smaller than its bulk value by a factor (n/N) 2/5 due to wall effects where n and N denote the number of bases in the probe and the target. At higher xeq, when the brush is formed, the leading correction is xeq/(1 ÿ xeq) ctK exp [ÿconst9(xeq 2/3 ÿ xB 2/3)] where xB corresponds to the onset of the brush regime. The denaturation rate constant in the two regimes is identical. However, the hybridization rate constant in the brush regime is lower, the leading correction being exp [ÿconst9(x 2/3 ÿ x B 2/3)].

Optimized Normalization for Antibody Microarrays and Application to Serum-Protein Profiling * □S

by Darren Hamelinck, Heping Zhou, Lin Li, Cornelius Verweij, Deborah Dillon, Ziding Feng, Jose Costa, Brian B. Haab
"... The measurements of coordinated patterns of protein abundance using antibody microarrays could be used to gain insight into disease biology and to probe the use of combinations of proteins for disease classification. The correct use and interpretation of antibody microarray data requires proper norm ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
The measurements of coordinated patterns of protein abundance using antibody microarrays could be used to gain insight into disease biology and to probe the use of combinations of proteins for disease classification. The correct use and interpretation of antibody microarray data requires proper normalization of the data, which has not yet been systematically studied. Therefore we undertook a study to determine the optimal normalization of data from antibody microarray profiling of proteins in human serum specimens. Forty-three serum samples collected from patients with pancreatic cancer and from control subjects were probed in triplicate on microarrays containing 48 different antibodies, using a direct labeling, twocolor comparative fluorescence detection format. Seven different normalization methods representing major classes of normalization for antibody microarray data were compared by their effects on reproducibility, accuracy, and trends in the data set. Normalization with ELISAdetermined concentrations of IgM resulted in the most accurate, reproducible, and reliable data. The other normalization methods were deficient in at least one of the criteria. Multiparametric classification of the samples based on the combined measurement of seven of the proteins demonstrated the potential for increased classification accuracy compared with the use of individual measurements. This study establishes reliable normalization for antibody microarray data, criteria for assessing normalization performance, and the capability of antibody microarrays for serum-protein profiling and multiparametric sample classification. Molecular & Cellular Proteomics

The Design and Analysis of Microarray Experiments: Applications in Parasitology

by David A. Morrison, John T. Ellis
"... Microarray experiments can generate enormous amounts of data, but large datasets are usually inherently complex, and the relevant information they contain can be difficult to extract. For the practicing biologist, we provide an overview of what we believe to be the most important issues that need to ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
Microarray experiments can generate enormous amounts of data, but large datasets are usually inherently complex, and the relevant information they contain can be difficult to extract. For the practicing biologist, we provide an overview of what we believe to be the most important issues that need to be addressed when dealing with microarray data. In a microarray experiment we are simply trying to identify which genes are the most “interesting ” in terms of our experimental question, and these will usually be those that are either overexpressed or underexpressed (upregulated or downregulated) under the experimental conditions. Analysis of the data to find these genes involves first preprocessing of the raw data for quality control, including filtering of the data (e.g., detection of outlying values) followed by standardization of the data (i.e., making the data uniformly comparable throughout the dataset). This is followed by the formal quantitative analysis of the data, which will involve either statistical hypothesis testing or multivariate pattern recognition. Statistical hypothesis testing is the usual approach to “class comparison, ” where several experimental groups are being directly compared. The best approach to this problem is to use analysis of variance, although issues related to multiple hypothesis testing and probability estimation still need to be evaluated. Pattern recognition can involve “class prediction, ” for which a range of supervised multivariate techniques are available, or “class discovery, ” for which an even broader range of unsupervised multivariate techniques have been developed. Each technique has its own limitations, which need to be kept in mind when making a choice from among them. To put these ideas in context, we provide a detailed examination of two specific examples of the analysis of microarray data, both from parasitology, covering many of the most important points raised.

Simulation of DNA array hybridization experiments and evaluation of critical parameters during subsequent image and data analysis

by Bmc Bioinformatics, Thorsten Elge (elqelrzpd. De, Pia Aanstad (aanstaditsa. Ucsf. Edu, Christoph Wierling, Matthias Steinfath, Matthias Steinfath, Thorsten Elge, Steffen Schulze-kremer, Pia Aanstad, Matthew Clark, Hans Lehrach, Hans Lehrach, Ralf Herwig, Ralf Herwig , 2002
"... Background: Gene expression analyses based on complex hybridization measurements have increased rapidly in recent years and have given rise to a huge amount of bioinformatic tools such as image analyses and cluster analyses. ..."
Abstract - Add to MetaCart
Background: Gene expression analyses based on complex hybridization measurements have increased rapidly in recent years and have given rise to a huge amount of bioinformatic tools such as image analyses and cluster analyses.

BioMed Central

by Bmc Bioinformatics Open, Dankyu Yoon, Sung-gon Yi, Ju-han Kim, Taesung Park
"... Background: In the microarray experiment, many undesirable systematic variations are commonly observed. Normalization is the process of removing such variation that affects the measured gene expression levels. Normalization plays an important role in the earlier stage of microarray data analysis. ..."
Abstract - Add to MetaCart
Background: In the microarray experiment, many undesirable systematic variations are commonly observed. Normalization is the process of removing such variation that affects the measured gene expression levels. Normalization plays an important role in the earlier stage of microarray data analysis. The subsequent analysis results are highly dependent on normalization.
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University