MetaCart Sign in to MyCiteSeerX

Include Citations | Advanced Search | Help

Disambiguated Search | Include Citations | Advanced Search | Help

Clustering with Instance-level Constraints (2000) [76 citations — 5 self]

by Kiri Wagstaff ,  Claire Cardie
Proceedings of the Seventeenth International Conference on Machine Learning
Add To MetaCart

Abstract:

Clustering algorithms conduct a search through the space of possible organizations of a data set. In this paper, we propose two types of instance-level clustering constraints - must-link and cannot-link constraints - and show how they can be incorporated into a clustering algorithm to aid that search. For three of the four data sets tested, our results indicate that the incorporation of surprisingly few such constraints can increase clustering accuracy while decreasing runtime. We also investigate the relative effects of each type of constraint and find that the type that contributes most to accuracy improvements depends on the behavior of the clustering algorithm without constraints.

Citations

4735 Maximum Likelihood from incomplete data via the EM algorithm – Dempster, Laird, et al. - 1977
2227 UCI repository of machine learning databases – Blake, Merz
523 Knowledge Acquisition via Incremental Concept Formation – Fisher - 1987
424 Dithered Quantizers – Gray, Stockham - 1993
319 The Swiss-Prot Protein Sequence Data Bank User Manual. Release 35 – Bairoch - 1997
72 Information, uncertainty, and the utility of categories – Gluck, Corter - 1985
49 Noun phrase coreference as clustering – Cardie, Wagstaff - 1999
21 Constraints on tree structure in concept formation – McKusick - 1991
18 A generic framework for constraint-directed search and scheduling – Beck, Fox - 1998
16 Integrating Declarative Knowledge in Hierarchical Clustering Tasks – Talavera, Bejar - 1999
14 Autoclass: A bayesian classi system – Cheeseman, Kelly, et al. - 1988
13 A methodology for processing problem constraints in genetic programming – Janikow - 1996
8 Iterative optimization and simpli cation of hierarchical clusterings – Fisher - 1996
7 Using background knowledge in multilayer perceptron learning – Lampinen, Selonen - 1997
4 Integrity constraints and interactive concept-learning – Raedt, Bruynooghe, et al. - 1991
1 Ordering eects in clustering – Fisher, Xu, et al. - 1992