Record Linkage: Current Practice and Future Directions (2003)
| Venue: | CSIRO Mathematical and Information Sciences |
| Citations: | 27 - 0 self |
BibTeX
@TECHREPORT{Gu03recordlinkage:,
author = {Lifang Gu and Rohan Baxter and Deanne Vickers and Chris Rainsford},
title = {Record Linkage: Current Practice and Future Directions},
institution = {CSIRO Mathematical and Information Sciences},
year = {2003}
}
Years of Citing Articles
OpenURL
Abstract
Record linkage is the task of quickly and accurately identifying records corresponding to the same entity from one or more data sources. Record linkage is also known as data cleaning, entity reconciliation or identification and the merge/purge problem. This paper presents the "standard" probabilistic record linkage model and the associated algorithm. Recent work in information retrieval, federated database systems and data mining have proposed alternatives to key components of the standard algorithm. The impact of these alternatives on the standard approach are assessed. The key question is whether and how these new alternatives are better in terms of time, accuracy and degree of automation for a particular record linkage application.







