Human performance on clustering web pages: a preliminary study (1998)

by Sofus A. Macskassy , Arunava Banerjee , Brian D. Davison , Haym Hirsh
Venue:In Proceedings of The Fourth International Conference on Knowledge Discovery and Data Mining (KDD-98
Citations:26 - 1 self

Documents Related by Co-Citation

622 Scatter/Gather: A Cluster-based Approach to Browsing Large Document Collections – Douglass R. Cutting, David R. Karger, Jan O. Pedersen, John W. Tukey - 1992
2703 Indexing by latent semantic analysis – Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Landauer, Richard Harshman - 1990
3234 The Anatomy of a Large-Scale Hypertextual Web Search Engine – Sergey Brin, Lawrence Page - 1998
2702 Authoritative Sources in a Hyperlinked Environment – Jon M. Kleinberg - 1999
101 Fast and Intuitive Clustering of Web Documents – Oren Zamir, Oren Etzioni, Omid Madani, Richard M. Karp - 1997
380 Reexamining the Cluster Hypothesis: Scatter/Gather on Retrieval Results – Marti A. Hearst, Jan O. Pedersen - 1996
1795 An algorithm for suffix stripping – M F Porter - 1980
329 Web Document Clustering: A Feasibility Demonstration – Oren Zamir, Oren Etzioni - 1998
8058 Maximum likelihood from incomplete data via the EM algorithm – A. P. Dempster, N. M. Laird, D. B. Rubin - 1977
420 A comparison of document clustering techniques – Michael Steinbach, George Karypis, Vipin Kumar - 2000
528 Using Linear Algebra for Intelligent Information Retrieval – Susan T. Dumais, Michael Berry, Michael W. Berry, Susan, T. Dumais - 1995
272 TileBars: Visualization of Term Distribution Information in Full Text Information Access – Marti A. Hearst - 1995
168 A technique for measuring the relative size and overlap of public web search engines – K Bharat, A Broder - 1998
277 Automatic Resource Compilation by Analyzing Hyperlink Structure and Associated Text – Soumen Chakrabarti , Byron Dom, Prabhakar Raghavan, Sridhar Rajagopalan, David Gibson, Jon Kleinberg - 1998
131 Clustering algorithms – E Rasmussen - 1992
1519 Term-weighting approaches in automatic text retrieval – Gerard Salton, Christopher Buckley - 1988
3111 Introduction to Modern Information Retrieval – G Salton, M J McGill - 1986
21 Ephemeral Document Clustering for Web Applications – YoĆ«lle S. Maarek, Ronald Fagin, Israel Z. Ben-Shaul, Dan Pelleg - 2000
268 An optimal graph theoretic approach to data clustering: Theory and its application to image segmentation – Zhenyu Wu, Richard Leahy - 1993