MetaCart Sign in to MyCiteSeerX

Include Citations | Advanced Search | Help

Disambiguated Search | Include Citations | Advanced Search | Help

CHAMELEON: A Hierarchical Clustering Algorithm Using Dynamic Modeling (1999) [145 citations — 14 self]

Abstract:

Clustering in data mining is a discovery process that groups a set of data such that the intracluster similarity is maximized and the intercluster similarity is minimized. Existing clustering algorithms, such as K-means, PAM, CLARANS, DBSCAN, CURE, and ROCK are designed to find clusters that fit some static models. These algorithms can breakdown if the choice of parameters in the static model is incorrect with respect to the data set being clustered, or if the model is not adequate to capture the characteristics of clusters. Furthermore, most of these algorithms breakdown when the data consists of clusters that are of diverse shapes, densities, and sizes. In this paper, we present a novel hierarchical clustering algorithm called CHAMELEON that measures the similarity of two clusters based on a dynamic model. In the clustering process, two clusters are merged only if the inter-connectivity and closeness (proximity) between two clusters are high relative to the internal inter-connectivit...

Citations

1478 Algorithms for Clustering Data – Jain, Dubes - 1988
1014 The Design and Analysis of Spatial Data Structures – Samet - 1989
728 Finding Groups in Data: An Introduction to Cluster Analysis – Kaufman, Rousseeuw - 1990
570 A Density-Based Algorithm for Discovering Clusters – Ester, Kriegel, et al. - 1996
442 Efficient and effective clustering methods for spatial data mining – Ng, Han - 1994
415 An algorithm for finding best matches in logarithmic expected time – Friedman, Bentley, et al. - 1977
362 Cure: an efficient clustering algorithm for large databases – Guha, Rastogi, et al. - 2001
361 Bayesian classification (AutoClass): Theory and results – Cheeseman, Stutz - 1995
315 Parallel multilevel k-way partitioning scheme for irregular graphs – Karypis, Kumar - 1996
280 Data mining: An overview from database perspective – Chen, Han, et al. - 1996
209 ROCK: A Robust Clustering Algorithm for Categorical Attributes – Guha, Rastogi, et al.
178 Scaling Clustering Algorithms to Large Databases – Bradley, Fayyad, et al. - 1998
168 Cost model for nearest neighbor search in highdimensional data space – Berchtold, Bohm, et al. - 1997
153 The pyramid-technique: towards breaking the curse of dimensionality – Berchtold, Böhm, et al. - 1998
109 The ISPD-98 Circuit Benchmark Suite – Alpert
80 Multilevel k-way Hypergraph Partitioning – Karypis, Kumar - 1999
79 Clustering based on association rule hypergraphs – HAN, KARYPIS, et al. - 1997
72 Clustering using a similarity measure based on shared nearest neighbors – JARVIS, PATRICK - 1973
70 Document categorization and query generation on the world wide web using WebACE – Boley, Gini, et al. - 1999
54 clustering for web document categorization. Decision Support Systems (accepted for publication – Boley, Gini, et al. - 1999
50 A fast and highly quality multilevel scheme for partitioning irregular graphs – KARYPIS, KUMAR - 1999
48 A distribution-based clustering algorithm for mining in large spatial databases – XU, ESTER, et al. - 1998
46 Hypergraph Based Clustering in High-Dimensional Data Sets: A Summary of Results – Han - 1998
36 Nearest neighbor clutter removal for estimating features in spatial point processes – Byers, Raftery - 1998
36 Clustering large datasets in arbitrary metric spaces – GANTI, RAMAKRISHNAN, et al. - 1999
36 DBMS research at a crossroads: The vienna update – Stonebraker, Agrawal, et al. - 1993
35 Agglomerative clustering using the concept of mutual nearest neighborhood – GOWDA, KRISHNA - 1978
23 Mega-classification: Discovering motifs in massive datastreams – Harris, Hunter, et al. - 1992
19 Birch: an efficient data clustering method for large databases – Zhang, Ramakrishnan, et al. - 1996
18 METIS 4.0: Unstructured graph partitioning and sparse matrix ordering system – Karypis, Kumar - 1998
14 hMETIS 1.5: A hypergraph partitioning package – Karypis, Kumar - 1998
14 Clustering and Classification – Arabie, Hubert, et al. - 1996
8 Implementation and testing of an automated EST processing and analysis system – Shoop, Chi, et al. - 1995
4 Arabidopsis thaliana expressed sequence tags: Generation, analysis and dissemination – Newman, Retzel, et al. - 1995
2 Some fundamental concepts and sysnthesis procedures for pattern recognition preprocessors – Ball, Hall - 1964