| Citations: | 7 - 2 self |
@MISC{Gelgi_abstractterm,
author = {Fatih Gelgi},
title = {ABSTRACT Term Ranking for Clustering Web Search Results},
year = {}
}
searches poses unique challenges. First, we show that one cannot readily import the frequency based feature ranking to cluster the web search results as in the text document clustering. Next, we present TermRank, a variation of the PageRank algorithm based on a relational graph representation of the content of web document collections. TermRank achieves desirable ranking of discriminative terms higher than the ambiguous terms, and ranking ambiguous terms higher than common terms. We experiment with two clustering algorithms to demonstrate the efficacy of TermRank. TermRank is shown to perform substantially better than frequency based classical methods.
Developed at and hosted by The College of Information Sciences and Technology
© 2007-2010 The Pennsylvania State University
