Citations
1766 | Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring
- Golub, Slonim, et al.
- 1999
(Show Context)
Citation Context ... dataset have been biologically characterized and assigned to different cell cycle phases. Leukemia: Approximately 7000 genes as objects, consisting of data for 38 ALL and AML patients as attributes (=-=Golub et al., 1999-=-; Famili and Ouyang, 2003). The objective of the original research was to identify the most informative genes for the purpose of disease modeling and more accurate classification of ALL/AML patients. ... |
765 |
Interpreting pattern of gene expression with selforganizing maps: methods and application to hematopoietic differentiation
- Tamayo, Slonim, et al.
- 1999
(Show Context)
Citation Context ...7 time points listed by Cho et al. (1998), from which we selected 2321 genes based on the largest variance in their expression. One abnormal time point was also removed from the dataset (suggested by =-=Tamayo et al., 1999-=-). These data have been used extensively in the literature for clustering and unsupervised pattern recognition. A large number of genes contained in this dataset have been biologically characterized a... |
523 |
A genome-wide transcriptional analysis of the mitotic cell cycle
- Cho, Campbell, et al.
- 1998
(Show Context)
Citation Context ...ics.oxfordjournals.org/ D ow nloaded from Evaluation and optimization of clustering in gene expression data analysis Fig. 9. The intensity spectrum plot of biologically characterized genes (listed by =-=Cho et al., 1998-=-) that are found in our meaningful clusters. (ii) Leukemia data: Figure 10 illustrates mean expression levels of the 13 clusters corresponding to the clustering results with the number of clusters at ... |
507 |
Silhouettes: A graphical aid to the interpretation and validation of cluster analysis
- Rousseeuw
- 1987
(Show Context)
Citation Context ... uses the complete dataset to determine the cluster quality (the original information is kept intact). Other methods use only some of the factors that determine quality, such as the silhouette index (=-=Rousseeuw, 1987-=-); or use only part of the complete dataset such as the re-sampling validation method (Dudoit and Fridlyand, 2002; Ben-Hur et al., 2002). In the following sections, we first describe related work and ... |
234 | S.: Applications of resampling methods to estimate the number of clusters and to improve the accuracy of clustering method
- Fridlyand, Dudoit
- 2001
(Show Context)
Citation Context ... Other methods use only some of the factors that determine quality, such as the silhouette index (Rousseeuw, 1987); or use only part of the complete dataset such as the re-sampling validation method (=-=Dudoit and Fridlyand, 2002-=-; Ben-Hur et al., 2002). In the following sections, we first describe related work and compare it with techniques used in our studies. We then introduce our stability-based technique and present the r... |
148 |
A stability based method for discovering structure in clustered data
- Ben-Hur, Elisseeff, et al.
- 2002
(Show Context)
Citation Context ... of the factors that determine quality, such as the silhouette index (Rousseeuw, 1987); or use only part of the complete dataset such as the re-sampling validation method (Dudoit and Fridlyand, 2002; =-=Ben-Hur et al., 2002-=-). In the following sections, we first describe related work and compare it with techniques used in our studies. We then introduce our stability-based technique and present the results of applying thi... |
127 | Validating clustering for gene expression data - Yeung, Haynor, et al. - 2001 |
71 | Analysis of temporal gene expression profiles: clustering by simulated annealing and determining the optimal number of clusters - Lukashin, Fuchs - 2001 |
53 | A cluster validity framework for genome expression data - Azuaje - 2002 |
53 | Cluster stability scores for microarray data in cancer studies - Smolkin, Gosh |
50 | Mining for putative regulatory elements in the yeast genome using gene expression data - Vilo, Brazma, et al. - 2000 |
43 | Stability-based model selection. In - Shah, Lange, et al. - 2003 |
33 | A literature-based method for assessing the functional coherence of a gene group - Raychaudhuri, Altman - 2003 |
22 | Assessing reliability of gene clusters from gene expression data - Zhang, Zhao |
9 |
Data mining: understanding data and disease modeling
- Famili, Ouyang
- 2003
(Show Context)
Citation Context ...iologically characterized and assigned to different cell cycle phases. Leukemia: Approximately 7000 genes as objects, consisting of data for 38 ALL and AML patients as attributes (Golub et al., 1999; =-=Famili and Ouyang, 2003-=-). The objective of the original research was to identify the most informative genes for the purpose of disease modeling and more accurate classification of ALL/AML patients. The most informative gene... |
8 | Comparisons and validation of clustering techniques for microarray gene expression data - Datta, Datta - 2003 |
6 | Stability-based cluster analysis applied to microarray data - Giurcăneanu, Tabus, et al. - 2003 |
4 |
Knowledge discovery in Hepatitis C Virus transgenic mice, submitted to the Evaluation and optimization of clustering in gene expression data analysis
- Famili
- 2003
(Show Context)
Citation Context ...nts. The most informative genes exhibit expression patterns strongly correlated with the class distinction (Golub et al., 1999). Hepatitis C virus: Containing 5756 genes for six repeated experiments (=-=Famili et al., 2003-=-) related to Hepatitis C transgenic mice. These data were originally used for gene identification. The expression level of the most informative genes should exhibit a large deviation between experimen... |
4 | Analysis of transforming growth factor (TGF)- modulated genes involved in the epithelial to mesenchymal transdifferentiation of murine mammary epithelial cells, Poster presentation at ASCR - O’Connor-McCourt - 2003 |
2 |
Cluster Analysis for Social Scientists
- Fiske
- 1983
(Show Context)
Citation Context ...pression data (the optimal number of clusters). The most common cluster validation techniques are based on one of the following three principles: external criteria, internal criteria and replication (=-=Fiske, 1983-=-). In most cases, external information is not known, and so internal criteria and replication techniques are more often used for cluster validation. Azuaje (2002) evaluated the validation of three int... |
2 |
Data mining of gene expression changes
- Walker, Smith, et al.
- 2004
(Show Context)
Citation Context ...f clustering experiments were performed. These all used K-means with a random seed selection and Euclidean as distance measure. All experiments were performed using our BioMiner data mining software (=-=Walker et al., 2004-=-). Table 1 contains the summary of these experiments. The distance measures listed in this table were selected from amongst 21 different distance measures available in this software. They were selecte... |
1 | at Pennsylvania State U niversity on M arch 5, 2016 http://bioinform atics.oxfordjournals.org/ D ow nloaded from Evaluation and optimization of clustering in gene expression data analysis - Eisen, Spellman, et al. - 1998 |