Data Integration Using Similarity Joins and a Word-Based Information Representation Language (2000)

by William W. Cohen
Venue:ACM TRANSACTIONS ON INFORMATION SYSTEMS
Citations:83 - 9 self

Documents Related by Co-Citation

300 The Merge/Purge Problem for Large Databases – Mauricio Hernandez, Mauricio A. Hern'andez, Salvatore Stolfo - 1995
91 Learning Object Identification Rules for Information Integration – Sheila Tejada, Craig A. Knoblock, Steven Minton - 2001
395 A Theory for Record Linkage – I P Fellegi, A B Sunter - 1969
128 Learning to Match and Cluster Large High-Dimensional Data Sets For Data Integration – William W. Cohen, Jacob Richman - 2002
151 The field matching problem: Algorithms and applications – Alvaro Monge, Charles Elkan - 1996
256 Efficient Clustering of High-Dimensional Data Sets with Application to Reference Matching – Andrew McCallum, Kamal Nigam, Lyle H. Ungar - 2000
131 Automatic linkage of vital records – H B Newcombe, J M Kennedy, S J Axford, A P James - 1959
196 Interactive Deduplication using Active Learning – Sunita Sarawagi, Anuradha Bhamidipaty - 2002
174 An Efficient Domain-Independent Algorithm for Detecting Approximately Duplicate Database Records – Alvaro Monge, Charles Elkan - 1997
217 The State of Record Linkage and Current Research Problems – William E. Winkler - 1999
237 Adaptive Duplicate Detection Using Learnable String Similarity Measures – Mikhail Bilenko, Raymond J. Mooney - 2003
193 Learning String Edit Distance – Eric Sven Ristad, Peter N. Yianilos - 1997
214 Integration of Heterogeneous Databases Without Common Domains Using Queries Based on Textual Similarity – William W. Cohen - 1998
325 Learning to Order Things – William W. Cohen, Robert E. Schapire, Yoram Singer - 1998
32 A theory for record linkage – I P Felligi, A B Sunter - 1969
332 A comparison of string distance metrics for name-matching tasks – William W. Cohen, Pradeep Ravikumar, Stephen E. Fienberg - 2003
142 Advances in record-linkage methodology as applied to matching the 1985 census of tampa, florida – M Jaro - 1989
149 Potter's Wheel: An Interactive Data Cleaning System – Vijayshankar Raman, Joseph M. Hellerstein - 2001
153 Identity Uncertainty and Citation Matching – Hanna Pasula, Bhaskara Marthi, Brian Milch, Stuart Russell, Ilya Shpitser - 2003