• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

A variancestabilizing transformation for gene-expression microarray data. Bioinformatics 18 Suppl 1 (2002)

by B P Durbin, J S Hardin, D M Hawkins, D M Rocke
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 144
Next 10 →

A Model Based Background Adjustment for Oligonucleotide Expression Arrays.

by Z Wu, Irizarry RA, R Gentleman, F Martinez-Murillo, F Spencer - Journal of the American Statistical Association , 2004
"... ..."
Abstract - Cited by 207 (4 self) - Add to MetaCart
Abstract not found

Model-based Variance-stabilizing Transformation for Illumina Microarray Data’, Nucleic Acids Res

by Simon M. Lin, Pan Du, Wolfgang Huber, Warren A. Kibbe , 2008
"... doi:10.1093/nar/gkm1075 ..."
Abstract - Cited by 101 (9 self) - Add to MetaCart
doi:10.1093/nar/gkm1075
(Show Context)

Citation Context

...s the measurement range, which is described as ‘heteroskedasticity’ in statistics, imposes a serious challenge when applying canonical linear models or analysis of variance (ANOVA) to microarray data =-=(6)-=-. So that these well-established and well-understood statistical models can be applied to microarray data, a data transformation strategy is usually applied to abrogate or at least reduce the heterosk...

Stochastic Models Inspired by Hybridization Theory for Short Oligonucleotide Arrays (Extended Abstract)

by Zhijin Wu, Rafael A. Irizarry - J. Comput. Biol , 2004
"... Zhijin Wu Johns Hopkins Bloomberg School of Public Health 615 North Wolfe Street zwu@jhsph.edu Rafael A. Irizarry Johns Hopkins Bloomberg School of Public Health 615 North Wolfe Street rafa@jhu.edu ABSTRACT High density oligonucleotide expression arrays are a widely used tool for the measureme ..."
Abstract - Cited by 80 (4 self) - Add to MetaCart
Zhijin Wu Johns Hopkins Bloomberg School of Public Health 615 North Wolfe Street zwu@jhsph.edu Rafael A. Irizarry Johns Hopkins Bloomberg School of Public Health 615 North Wolfe Street rafa@jhu.edu ABSTRACT High density oligonucleotide expression arrays are a widely used tool for the measurement of gene expression on a large scale. A#ymetrix GeneChip arrays appear to dominate this market. These arrays use short oligonucleotides to probe for genes in an RNA sample. Due to optical noise, nonspecific hybridization, probe-specific e#ects, and measurement error, ad-hoc measures of expression, that summarize probe intensities, can lead to imprecise and inaccurate results. Various researchers have demonstrated that expression measures based on simple statistical models can provide great improvements over the ad-hoc procedure o#ered by A#ymetrix. Recently, physical models based on molecular hybridization theory, have been proposed as useful tools for prediction of, for example, non-specific hybridization. These physical models show great potential in terms of improving existing expression measures. In this paper we suggest that the system producing the measured intensities is too complex to be fully described with these relatively simple physical models and we propose empirically motivated stochastic models that compliment the above mentioned molecular hybridization theory to provide a comprehensive description of the data. We discuss how the proposed model can be used to obtain improved measures of expression useful for the data analysts.
(Show Context)

Citation Context

...a multiplicative measurement error model [7, 19, 15, 20] for microarray data. A slightly more complicated additive background multiplicative measurement error model has been proposed by, for example, =-=[11, 6, 8]-=-. In Figure 1a we see observed P M log (base 2) intensities from the spike-in data plotted against their nominal log (base 2) concentration. The solid line shows the median value for each concentratio...

Rosetta error model for gene expression analysis

by Lee Weng, Hongyue Dai, Yihui Zhan, Yudong He, Sergey B. Stepaniants, E. Bassett, John Quackenbush - Bioinformatics , 2006
"... The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and ..."
Abstract - Cited by 61 (0 self) - Add to MetaCart
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions@oxfordjournals.org

Detecting differentially expressed genes in microarrays using Bayesian model selection

by Hemant Ishwaran, J. Sunil Rao - Journal of the American Statistical Association
"... DNA microarrays open up a broad new horizon for investigators interested in studying the genetic determinants of disease. The high throughput nature of these arrays, where differential expression for thousands of genes can be measured simultaneously, creates an enormous wealth of information, but al ..."
Abstract - Cited by 57 (9 self) - Add to MetaCart
DNA microarrays open up a broad new horizon for investigators interested in studying the genetic determinants of disease. The high throughput nature of these arrays, where differential expression for thousands of genes can be measured simultaneously, creates an enormous wealth of information, but also poses a challenge for data analysis because of the large multiple testing problem involved. The solution has generally been to focus on optimizing false-discovery rates while sacri � cing power. The drawback of this approach is that more subtle expression differences will be missed that might give investigators more insight into the genetic environment necessary for a disease process to take hold. We introduce a new method for detecting differentially expressed genes based on a high-dimensional model selection technique, Bayesian ANOVA for microarrays (BAM), which strikes a balance between false rejections and false nonrejections. The basis of the new approach involves a weighted average of generalized ridge regression estimates that provides the bene � ts of using shrinkage estimation combined with model averaging. A simple graphical tool based on the amount of shrinkage is developed to visualize the trade-off between low false-discovery rates and � nding more genes. Simulations are used to illustrate BAM’s performance, and the method is applied to a large database of colon cancer gene expression data. Our working hypothesis in the colon cancer analysis is that large differential expressions may not be the only ones contributing to metastasis—in fact, moderate changes in expression of genes may be involved in modifying the genetic environment to a suf � cient extent for metastasis to occur. A functional biological analysis of gene effects found by BAM, but not other false-discovery-based approaches, lends support to this hypothesis. KEY WORDS: Bayesian analysis of variance for microarrays; False discovery rate; False nondiscovery rate; Heteroscedasticity; Ridge

Transformation and normalization of oligonucleotide microarray data

by Sue C. Geller, Jeff P. Gregg, Paul Hagerman, David M. Rocke - Bioinformatics , 2003
"... ∗ To whom correspondence should be addressed ..."
Abstract - Cited by 33 (4 self) - Add to MetaCart
∗ To whom correspondence should be addressed

Estimation of transformation parameters for microarray data

by Blythe Durbin, David M. Rocke - Bioinformatics , 2003
"... a family of transformations (the generalized-log family) which stabilizes the variance of microarray data up to the first order. We introduce a method for estimating the transformation parameter in tandem with a linear model based on the procedure outlined in Box and Cox (1964). We also discuss mean ..."
Abstract - Cited by 31 (6 self) - Add to MetaCart
a family of transformations (the generalized-log family) which stabilizes the variance of microarray data up to the first order. We introduce a method for estimating the transformation parameter in tandem with a linear model based on the procedure outlined in Box and Cox (1964). We also discuss means of finding transformations within the generalized-log family which are optimal under other criteria, such as minimum residual skewness and minimum mean-variance dependency. Availability: R and Matlab code and test data are available from the authors on request. Contact:
(Show Context)

Citation Context

...nsformation converges to ln(z)+ ln(2) for large z (equivalent to a log transformation, as the additive constant does not affect the strength of the transformation) , and is approximately linear at 0 (=-=Durbin et al., 2002-=-). The inverse transformation is h −1 λ (u) = (eu − λe −u )/2. Both hλ and its inverse are monotonic functions, defined for all values of z and u, with derivatives of all orders. When transforming dat...

Spike and slab gene selection for multigroup microarray data

by Hemant Ishwaran, J. Sunil Rao - J. Amer. Statist. Assoc , 2005
"... DNA microarrays can provide insight into genetic changes that characterize different stages of a disease process. Accurate identification of these changes has significant therapeutic and diagnostic implications. Statistical analysis for multistage (multigroup) data is challenging, however. ANOVA-bas ..."
Abstract - Cited by 30 (7 self) - Add to MetaCart
DNA microarrays can provide insight into genetic changes that characterize different stages of a disease process. Accurate identification of these changes has significant therapeutic and diagnostic implications. Statistical analysis for multistage (multigroup) data is challenging, however. ANOVA-based extensions of two-sample Z-tests, a popular method for detecting differentially expressed genes in two groups, do not work well in multigroup settings. False detection rates are high because of variability of the ordinary least squares estimators and because of regression to the mean induced by correlated parameter estimates. We develop a Bayesian rescaled spike and slab hierarchical model specifically designed for the multigroup gene detection problem. Data preprocessing steps are introduced to deal with unique features of microarray data and to enhance selection performance. We show theoretically that spike and slab models naturally encourage sparse solutions through a process called selective shrinkage. This translates into oracle-like gene selection risk performance compared with ordinary least squares estimates. The methodology is illustrated on a large microarray repository of samples from different clinical stages of metastatic colon cancer. Through a functional analysis of selected genes, we show that spike and slab models identify important biological signals while minimizing biologically implausible false detections.

Integrated analysis of transcriptomic and proteomic data of Desulfovibrio vulgaris: zero-inflated Poisson regression models to predict abundance of undetected proteins.

by Lei Nie , ; Gang , Wu , David E Culley , ; Johannes , C M Scholten , Weiwen Zhang , Lei, Wu, Gang, Culley David E Nie , Scholten Johannes C M Zhang , Weiwen - Bioinformatics. , 2006
"... ..."
Abstract - Cited by 28 (4 self) - Add to MetaCart
Abstract not found

VarMixt: efficient variance modelling for the differential analysis of replicated gene expression data

by Paul Delmar, Grande Voie Des Vignes, Chatenay Malabry France, Jean Jacques Daudin, Stéphane Robin - Bioinformatics , 2005
"... replicated gene expression data. ..."
Abstract - Cited by 27 (1 self) - Add to MetaCart
replicated gene expression data.
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University