Human performance on clustering web pages: a preliminary study (1998)

by Sofus A. Macskassy , Arunava Banerjee , Brian D. Davison , Haym Hirsh
Venue:In Proceedings of The Fourth International Conference on Knowledge Discovery and Data Mining (KDD-98
Citations:26 - 1 self

Documents Related by Co-Citation

619 Scatter/Gather: A Cluster-based Approach to Browsing Large Document Collections – Douglass R. Cutting, David R. Karger, Jan O. Pedersen, John W. Tukey - 1992
2700 Indexing by latent semantic analysis – Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Landauer, Richard Harshman - 1990
3226 The Anatomy of a Large-Scale Hypertextual Web Search Engine – Sergey Brin, Lawrence Page - 1998
2693 Authoritative Sources in a Hyperlinked Environment – Jon M. Kleinberg - 1999
380 Reexamining the Cluster Hypothesis: Scatter/Gather on Retrieval Results – Marti A. Hearst, Jan O. Pedersen - 1996
1792 An algorithm for suffix stripping – M Porter - 1980
131 Clustering Algorithms – E Rasmussen - 1992
101 Fast and Intuitive Clustering of Web Documents – Oren Zamir, Oren Etzioni, Omid Madani, Richard M. Karp - 1997
1517 Term-weighting approaches in automatic text retrieval – Gerard Salton, Christopher Buckley - 1988
21 Ephemeral Document Clustering for Web Applications – YoĆ«lle S. Maarek, Ronald Fagin, Israel Z. Ben-Shaul, Dan Pelleg - 2000
61 An Information-Theoretic External Cluster-Validity Measure – Byron E. Dom, Byron E. Dom - 2001
234 Grouper: A Dynamic Clustering Interface to Web Search Results – Oren Zamir, Oren Etzioni - 1999
74 Partitioning-based clustering for web document categorization. Decision Support Systems – Daniel Boley, Maria Gini, Robert Gross, Eui-hong (sam Han, Kyle Hastings, George Karypis, Vipin Kumar, Bamshad Mobasher, Jerome Moore - 1999
528 Using Linear Algebra for Intelligent Information Retrieval – Susan T. Dumais, Michael Berry, Michael W. Berry, Susan, T. Dumais - 1995
272 TileBars: Visualization of Term Distribution Information in Full Text Information Access – Marti A. Hearst - 1995
168 A technique for measuring the relative size and overlap of public web search engines – K Bharat, A Broder - 1998
277 Automatic Resource Compilation by Analyzing Hyperlink Structure and Associated Text – Soumen Chakrabarti , Byron Dom, Prabhakar Raghavan, Sridhar Rajagopalan, David Gibson, Jon Kleinberg - 1998
329 Web Document Clustering: A Feasibility Demonstration – Oren Zamir, Oren Etzioni - 1998
3108 Introduction to Modern Information Retrieval – G Salton - 1983