Distributed Information Retrieval (2000)
| Venue: | In: Advances in Information Retrieval |
| Citations: | 116 - 18 self |
BibTeX
@INPROCEEDINGS{Callan00distributedinformation,
author = {Jamie Callan},
title = {Distributed Information Retrieval},
booktitle = {In: Advances in Information Retrieval},
year = {2000},
pages = {127--150},
publisher = {Kluwer Academic Publishers}
}
Years of Citing Articles
OpenURL
Abstract
A multi-database model of distributed information retrieval is presented, in which people are assumed to have access to many searchable text databases. In such an environment, full-text information retrieval consists of discovering database contents, ranking databases by their expected ability to satisfy the query, searching a small number of databases, and merging results returned by different databases. This paper presents algorithms for each task. It also discusses how to reorganize conventional test collections into multi-database testbeds, and evaluation methodologies for multi-database experiments. A broad and diverse group of experimental results is presented to demonstrate that the algorithms are effective, efficient, robust, and scalable. 1. INTRODUCTION Wide area networks, particularly the Internet, have transformed how people interact with information. Much of the routine information access by the general public is now based on full-text information retrieval, as opposed t...







