Topical Locality in the Web (2000)
| Venue: | In Proceedings of the 23rd Annual International Conference on Research and Development in Information Retrieval (SIGIR 2000 |
| Citations: | 108 - 8 self |
BibTeX
@INPROCEEDINGS{Davison00topicallocality,
author = {Brian D. Davison},
title = {Topical Locality in the Web},
booktitle = {In Proceedings of the 23rd Annual International Conference on Research and Development in Information Retrieval (SIGIR 2000},
year = {2000},
pages = {272--279},
publisher = {ACM Press}
}
Years of Citing Articles
OpenURL
Abstract
Most web pages are linked to others with related content. This idea, combined with another that says that text in, and possibly around, HTML anchors describe the pages to which they point, is the foundation for a usable WorldWide Web. In this paper, we examine to what extent these ideas hold by empirically testing whether topical locality mirrors spatial locality of pages on the Web. In particular, we find that the likelihood of linked pages having similar textual content to be high; the similarity of sibling pages increases when the links from the parent are close together; titles, descriptions, and anchor text represent at least part of the target page; and that anchor text may be a useful discriminator among unseen child pages. These results show the foundations necessary for the success of many web systems, including search engines, focused crawlers, linkage analyzers, and intelligent web agents.







