Combining text and link analysis for focused crawling–an application for vertical search engines Information Systems (2007)
| Citations: | 8 - 0 self |
BibTeX
@MISC{Almpanidis07combiningtext,
author = {George Almpanidis and Constantine Kotropoulos},
title = {Combining text and link analysis for focused crawling–an application for vertical search engines Information Systems},
year = {2007}
}
OpenURL
Abstract
Abstract. The number of vertical search engines and portals has rapidly increased over the last years, making the importance of a topic-driven (focused) crawler evident. In this paper, we develop a latent semantic indexing classifier that combines link analysis with text content in order to retrieve and index domain specific web documents. We compare its efficiency with other well-known web information retrieval techniques. Our implementation presents a different approach to focused crawling and aims to overcome the limitations of the neccesity to provide initial training data while maintaining a high recall/precision ratio. 1







