Human performance on clustering web pages: a preliminary study (1998)

by Sofus A. Macskassy , Arunava Banerjee , Brian D. Davison , Haym Hirsh
Venue:In Proceedings of The Fourth International Conference on Knowledge Discovery and Data Mining (KDD-98
Citations:26 - 1 self

Documents Related by Co-Citation

519 Scatter/Gather: A Cluster-based Approach to Browsing Large Document Collections – Douglass R. Cutting, David R. Karger, Jan O. Pedersen, John W. Tukey - 1992
2168 Indexing by latent semantic analysis – Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Landauer, Richard Harshman - 1990
2570 The Anatomy of a Large-Scale Hypertextual Web Search Engine – Sergey Brin, Lawrence Page - 1998
2222 Authoritative Sources in a Hyperlinked Environment – Jon M. Kleinberg - 1999
87 Fast and Intuitive Clustering of Web Documents – Oren Zamir, Oren Etzioni, Omid Madani, Richard M. Karp - 1997
331 Reexamining the Cluster Hypothesis: Scatter/Gather on Retrieval Results – Marti A. Hearst, Jan O. Pedersen - 1996
1459 An algorithm for suffix stripping – M Porter - 1980
279 Web Document Clustering: A Feasibility Demonstration – Oren Zamir, Oren Etzioni - 1998
6234 Maximum likelihood from incomplete data via the EM algorithm – A. P. Dempster, N. M. Laird, D. B. Rubin - 1977
306 A comparison of document clustering techniques – Michael Steinbach, George Karypis, Vipin Kumar - 2000
450 Using Linear Algebra for Intelligent Information Retrieval – Susan T. Dumais, Michael Berry, Michael W. Berry, Susan, T. Dumais - 1995
238 TileBars: Visualization of Term Distribution Information in Full Text Information Access – Marti A. Hearst - 1995
148 A technique for measuring the relative size and overlap of public web search engines – K Bharat, A Z Broder - 1998
251 Automatic Resource Compilation by Analyzing Hyperlink Structure and Associated Text – Soumen Chakrabarti , Byron Dom, Prabhakar Raghavan, Sridhar Rajagopalan, David Gibson, Jon Kleinberg - 1998
119 Clustering algorithms – E Rasmussen - 1992
1216 Term-weighting approaches in automatic text retrieval – Gerard Salton, Christopher Buckley - 1988
2699 MJ: Introduction to Modern Information Retrieval – G Salton, McGill - 1986
16 Ephemeral Document Clustering for Web Applications – YoĆ«lle S. Maarek, Ronald Fagin, Israel Z. Ben-Shaul, Dan Pelleg - 2000
200 An optimal graph theoretic approach to data clustering: Theory and its application to image segmentation – Zhenyu Wu, Richard Leahy - 1993