Clustering in data mining is a discovery process that groups a set of data such that the intracluster similarity is maximized and the intercluster similarity is minimized. Existing clustering algorithms, such as K-means, PAM, CLARANS, DBSCAN, CURE, and ROCK are designed to find clusters that fit some static models. These algorithms can breakdown if the choice of parameters in the static model is incorrect with respect to the data set being clustered, or if the model is not adequate to capture the characteristics of clusters. Furthermore, most of these algorithms breakdown when the data consists of clusters that are of diverse shapes, densities, and sizes. In this paper, we present a novel hierarchical clustering algorithm called CHAMELEON that measures the similarity of two clusters based on a dynamic model. In the clustering process, two clusters are merged only if the inter-connectivity and closeness (proximity) between two clusters are high relative to the internal inter-connectivit...
|
1478
|
Algorithms for Clustering Data
– Jain, Dubes
- 1988
|
|
1014
|
The Design and Analysis of Spatial Data Structures
– Samet
- 1989
|
|
728
|
Finding Groups in Data: An Introduction to Cluster Analysis
– Kaufman, Rousseeuw
- 1990
|
|
570
|
A Density-Based Algorithm for Discovering Clusters
– Ester, Kriegel, et al.
- 1996
|
|
442
|
Efficient and effective clustering methods for spatial data mining
– Ng, Han
- 1994
|
|
415
|
An algorithm for finding best matches in logarithmic expected time
– Friedman, Bentley, et al.
- 1977
|
|
362
|
Cure: an efficient clustering algorithm for large databases
– Guha, Rastogi, et al.
- 2001
|
|
361
|
Bayesian classification (AutoClass): Theory and results
– Cheeseman, Stutz
- 1995
|
|
315
|
Parallel multilevel k-way partitioning scheme for irregular graphs
– Karypis, Kumar
- 1996
|
|
280
|
Data mining: An overview from database perspective
– Chen, Han, et al.
- 1996
|
|
209
|
ROCK: A Robust Clustering Algorithm for Categorical Attributes
– Guha, Rastogi, et al.
|
|
178
|
Scaling Clustering Algorithms to Large Databases
– Bradley, Fayyad, et al.
- 1998
|
|
168
|
Cost model for nearest neighbor search in highdimensional data space
– Berchtold, Bohm, et al.
- 1997
|
|
153
|
The pyramid-technique: towards breaking the curse of dimensionality
– Berchtold, Böhm, et al.
- 1998
|
|
109
|
The ISPD-98 Circuit Benchmark Suite
– Alpert
|
|
80
|
Multilevel k-way Hypergraph Partitioning
– Karypis, Kumar
- 1999
|
|
79
|
Clustering based on association rule hypergraphs
– HAN, KARYPIS, et al.
- 1997
|
|
72
|
Clustering using a similarity measure based on shared nearest neighbors
– JARVIS, PATRICK
- 1973
|
|
70
|
Document categorization and query generation on the world wide web using WebACE
– Boley, Gini, et al.
- 1999
|
|
54
|
clustering for web document categorization. Decision Support Systems (accepted for publication
– Boley, Gini, et al.
- 1999
|
|
50
|
A fast and highly quality multilevel scheme for partitioning irregular graphs
– KARYPIS, KUMAR
- 1999
|
|
48
|
A distribution-based clustering algorithm for mining in large spatial databases
– XU, ESTER, et al.
- 1998
|
|
46
|
Hypergraph Based Clustering in High-Dimensional Data Sets: A Summary of Results
– Han
- 1998
|
|
36
|
Nearest neighbor clutter removal for estimating features in spatial point processes
– Byers, Raftery
- 1998
|
|
36
|
Clustering large datasets in arbitrary metric spaces
– GANTI, RAMAKRISHNAN, et al.
- 1999
|
|
36
|
DBMS research at a crossroads: The vienna update
– Stonebraker, Agrawal, et al.
- 1993
|
|
35
|
Agglomerative clustering using the concept of mutual nearest neighborhood
– GOWDA, KRISHNA
- 1978
|
|
23
|
Mega-classification: Discovering motifs in massive datastreams
– Harris, Hunter, et al.
- 1992
|
|
19
|
Birch: an efficient data clustering method for large databases
– Zhang, Ramakrishnan, et al.
- 1996
|
|
18
|
METIS 4.0: Unstructured graph partitioning and sparse matrix ordering system
– Karypis, Kumar
- 1998
|
|
14
|
hMETIS 1.5: A hypergraph partitioning package
– Karypis, Kumar
- 1998
|
|
14
|
Clustering and Classification
– Arabie, Hubert, et al.
- 1996
|
|
8
|
Implementation and testing of an automated EST processing and analysis system
– Shoop, Chi, et al.
- 1995
|
|
4
|
Arabidopsis thaliana expressed sequence tags: Generation, analysis and dissemination
– Newman, Retzel, et al.
- 1995
|
|
2
|
Some fundamental concepts and sysnthesis procedures for pattern recognition preprocessors
– Ball, Hall
- 1964
|