• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Semi-supervised graph clustering: a kernel approach (2008)

Cached

  • Download as a PDF

Download Links

  • [www.cs.utexas.edu]
  • [www.cs.utexas.edu]
  • [www.cs.utexas.edu]
  • [www.cs.utexas.edu]
  • [www.cs.utexas.edu]
  • [www.cs.utexas.edu]
  • [www.cs.utexas.edu]
  • [www.cs.utexas.edu]

  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Brian Kulis , Sugato Basu , Inderjit Dhillon , Raymond Mooney
Citations:24 - 1 self
  • Summary
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@MISC{Kulis08semi-supervisedgraph,
    author = {Brian Kulis and Sugato Basu and Inderjit Dhillon and Raymond Mooney},
    title = { Semi-supervised graph clustering: a kernel approach},
    year = {2008}
}

Bookmark

citeulike Connotea Bibsonomy Del.icio.us Digg Reddit

OpenURL

 

Abstract

Semi-supervised clustering algorithms aim to improve clustering results using limited supervision. The supervision is generally given as pairwise constraints; such constraints are natural for graphs, yet most semi-supervised clustering algorithms are designed for data represented as vectors. In this paper, we unify vector-based and graph-based approaches. We first show that a recently-proposed objective function for semi-supervised clustering based on Hidden Markov Random Fields, with squared Euclidean distance and a certain class of constraint penalty functions, can be expressed as a special case of the weighted kernel k-means objective (Dhillon et al., in Proceedings of the 10th International Conference on Knowledge Discovery and Data Mining, 2004a). A recent theoretical connection between weighted kernel k-means and several graph clustering objectives enables us to perform semi-supervised clustering of data given either as vectors or as a graph. For graph data, this result leads to algorithms for optimizing several new semi-supervised graph clustering objectives. For vector data, the kernel approach also enables us to find clusters with non-linear boundaries in the input data space. Furthermore, we show that recent work on spectral learning (Kamvar et al., in Proceedings of the 17th International Joint Conference on Artificial Intelligence, 2003) may be viewed as a special case of our formulation. We empirically show that our algorithm is able to outperform current state-of-the-art semi-supervised algorithms on both vector-based and graph-based data sets.

Citations

6517 Elements of information theory - Cover, Thomas - 1991
3339 Pattern Classification and Scene Analysis - Duda, Hart - 1973
1824 Normalized cuts and image segmentation - Shi, Malik - 2000
742 An introduction to Support Vector Machines - Cristianini, Shawe-Taylor - 2000
357 Distance metric learning, with application to clustering with side-information - Xing, Ng, et al. - 2003
250 Schrodl S: Constrained k-means clustering with background knowledge - Wagstaff, Cardie, et al.
224 The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features - Grauman, Darrell - 2005
171 Semi-Supervised Learning - Chapelle, Schlkopf, et al. - 2006
158 Correlation clustering - Bansal, Blum, et al. - 2002
134 A probabilistic framework for semi-supervised clustering - Basu, Bilenko, et al. - 2004
131 Approximation algorithms for classification problems with pairwise relationships: Metric labeling and markov random fields - Kleinberg, Tardos - 1999
130 J: A Random Walks View of Spectral Segmentation - Meila, Shi
125 Mooney RJ: Integrating constraints and metric learning in semi-supervised clustering - Bilenko, Basu
122 Impact of Similarity Measures on Web-Page Clustering - Strehl, Ghosh, et al. - 2000
120 Multiclass spectral clustering - Yu, Shi - 2003
119 From Instance-Level Constraints to SpaceLevel Constraints: Making the Most of Prior Knowledge in Data Clustering - Klein, Kamvar, et al. - 2002
98 Mooney R: Semi-supervised clustering by seeding - Basu, Banerjee
94 D.: Learning distance functions using equivalence relations - Bar-Hillel, Hertz, et al. - 2003
80 KEGG: Kyoto Encyclopedia of Genes and Genomes - Ogata, Goto, et al. - 1999
79 A probabilistic functional network of yeast genes - Lee, Date, et al. - 2004
77 Kernel k-means, spectral clustering and normalized cuts - Dhillon, Guan, et al. - 2004
77 S: Clustering based on conditional distributions in an auxiliary space - Sinkkonen, Kaski
69 Clustering with qualitative information - Charikar, Guruswami, et al. - 2005
60 Active semi-supervision for pairwise constrained clustering - Basu, Banerjee, et al. - 2004
50 C.D.: Spectral learning - Kamvar, Klein, et al. - 2003
50 Semi-Supervised Clustering Using Genetic Algorithms - Demiriz, Bennett, et al. - 1999
47 Weighted graph cuts without eigenvectors: a multilevel approach - Dhillon, Guan, et al.
40 A unified view of kernel k-means, spectral clustering, and graph partitioning - Dhillon, Guan, et al. - 2005
40 Clustering with constraints: Feasibility issues and the k-means algorithm - Davidson, Ravi - 2005
39 Correlation clustering with partial information - Demaine, Immorlica - 2003
26 Semi-supervised learning with Penalized Probabilistic Clustering - Lu, Leen
21 Locally linear metric adaptation for semi-supervised clustering - Chang, Yeung - 2004
20 A discriminative learning framework with pairwise constraints for video object classification - Yan, Zhang, et al.
16 Agglomerative hierarchical clustering with constraints: Theoretical and empirical results - Davidson, Ester, et al. - 2005
15 Learning with constrained and unlabelled data - Lange, Law, et al. - 2005
14 Spectral k-way ratio cut partitioning - Chan, Schlag, et al.
12 Model-based clustering with probabilistic constraints - Law, Topchy, et al. - 2005
10 Fast low-rank semidefinite programming for embedding and clustering - Kulis, Surendran, et al. - 2007
2 Efficiently learning the metric using sideinformation - Bie, Momma, et al.
2 A comparison of inference techniques for semi-supervised clustering with hidden Markov random fields - Bilenko, Basu - 2004
1 Kernels and regularization on computational graphs - Smola, Kondor - 2003
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University