• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 584
Next 10 →

Collective entity resolution in relational data

by Indrajit Bhattacharya, Lise Getoor - ACM Transactions on Knowledge Discovery from Data (TKDD , 2006
"... Many databases contain uncertain and imprecise references to real-world entities. The absence of identifiers for the underlying entities often results in a database which contains multiple references to the same entity. This can lead not only to data redundancy, but also inaccuracies in query proces ..."
Abstract - Cited by 146 (12 self) - Add to MetaCart
of references. However, there is often additional relational information in the data. Specifically, references to different entities may cooccur. In these cases, collective entity resolution, in which entities for cooccurring references are determined jointly rather than independently, can improve entity

A Machine Learning Approach to Coreference Resolution of Noun Phrases

by Wee Meng Soon, Daniel Chung, Daniel Chung Yong Lim, Yong Lim, Hwee Tou Ng , 2001
"... this paper, we present a learning approach to coreference resolution of noun phrases in unrestricted text. The approach learns from a small, annotated corpus and the task includes resolving not just a certain type of noun phrase (e.g., pronouns) but rather general noun phrases. It also does not rest ..."
Abstract - Cited by 270 (3 self) - Add to MetaCart
not restrict the entity types of the noun phrases; that is, coreference is assigned whether they are of "organization," "person," or other types. We evaluate our approach on common data sets (namely, the MUC-6 and MUC-7 coreference corpora) and obtain encouraging results, indicating

Entity Resolution with Markov Logic

by Parag Singla, Pedro Domingos - In ICDM , 2006
"... Entity resolution is the problem of determining which records in a database refer to the same entities, and is a crucial and expensive step in the data mining process. Interest in it has grown rapidly in recent years, and many approaches have been proposed. However, they tend to address only isolate ..."
Abstract - Cited by 105 (10 self) - Add to MetaCart
Entity resolution is the problem of determining which records in a database refer to the same entities, and is a crucial and expensive step in the data mining process. Interest in it has grown rapidly in recent years, and many approaches have been proposed. However, they tend to address only

Crowder: Crowdsourcing entity resolution

by Jiannan Wang, Tim Kraska, Michael J. Franklin, Jianhua Feng - PVLDB , 2012
"... Entity resolution is central to data integration and data cleaning. Algorithmic approaches have been improving in quality, but remain far from perfect. Crowdsourcing platforms offer a more accurate but expensive (and slow) way to bring human insight into the process. Previous work has proposed batch ..."
Abstract - Cited by 73 (6 self) - Add to MetaCart
Entity resolution is central to data integration and data cleaning. Algorithmic approaches have been improving in quality, but remain far from perfect. Crowdsourcing platforms offer a more accurate but expensive (and slow) way to bring human insight into the process. Previous work has proposed

A Latent Dirichlet Model for Unsupervised Entity Resolution

by Indrajit Bhattacharya, Lise Getoor - SIAM INTERNATIONAL CONFERENCE ON DATA MINING , 2006
"... Entity resolution has received considerable attention in recent years. Given many references to underlying entities, the goal is to predict which references correspond to the same entity. We show how to extend the Latent Dirichlet Allocation model for this task and propose a probabilistic model for ..."
Abstract - Cited by 102 (6 self) - Add to MetaCart
variable. We propose a novel sampling algorithm for collective entity resolution which is unsupervised and also takes entity relations into account. Additionally, we do not assume the domain of entities to be known and show how to infer the number of entities from the data. We demonstrate the utility

Entity Resolution for Uncertain Data

by unknown authors
"... Entity resolution (ER), also known as duplicate detection or record matching, is the prob-lem of identifying the tuples that represent the same real world entity. In this paper, we address the problem of ER for uncertain data, which we call ERUD. We propose two dif-ferent approaches for the ERUD pro ..."
Abstract - Add to MetaCart
Entity resolution (ER), also known as duplicate detection or record matching, is the prob-lem of identifying the tuples that represent the same real world entity. In this paper, we address the problem of ER for uncertain data, which we call ERUD. We propose two dif-ferent approaches for the ERUD

Generic Entity Resolution with Data Confidences

by David Menestrina, Omar Benjelloun, Hector Garcia-molina - In VLDB CleanDB Workshop, Seoul, Korea , 2006
"... We consider the Entity Resolution (ER) problem (also known as deduplication, or merge-purge), in which records determined to represent the same realworld entity are successively located and merged. Our approach to the ER problem is generic, in the sense that the functions for comparing and merging r ..."
Abstract - Cited by 22 (4 self) - Add to MetaCart
We consider the Entity Resolution (ER) problem (also known as deduplication, or merge-purge), in which records determined to represent the same realworld entity are successively located and merged. Our approach to the ER problem is generic, in the sense that the functions for comparing and merging

Query-time entity resolution

by Indrajit Bhattacharya, Lise Getoor - In The ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD , 2006
"... Entity resolution is the problem of reconciling database references corresponding to the same real-world entities. Given the abundance of publicly available databases that have unresolved entities, we motivate the problem of query-time entity resolution: quick and accurate resolution for answering q ..."
Abstract - Cited by 22 (2 self) - Add to MetaCart
of the performance trends predicted by our analysis of collective entity resolution over a wide range of structural characteristics in the data. 1.

Entity Resolution with Evolving Rules

by Steven Euijong Whang, Hector Garcia-molina
"... Entity resolution (ER) identifies database records that refer to the same real world entity. In practice, ER is not a one-time process, but is constantly improved as the data, schema and application are better understood. We address the problem of keeping the ER result up-to-date when the ER logic “ ..."
Abstract - Cited by 11 (2 self) - Add to MetaCart
Entity resolution (ER) identifies database records that refer to the same real world entity. In practice, ER is not a one-time process, but is constantly improved as the data, schema and application are better understood. We address the problem of keeping the ER result up-to-date when the ER logic

Adaptive graphical approach to entity resolution

by Zhaoqi Chen, Dmitri V. Kalashnikov, Sharad Mehrotra - In: ACM IEEE Joint Conference on Digital Libraries 2007 (ACM IEEE JCDL 2007 , 2007
"... Entity resolution is a very common Information Quality (IQ) problem with many different applications. In digital libraries, it is related to problems of citation matching and author name disambiguation; in Natural Language Processing, it is related to coreference matching and object identity; in Web ..."
Abstract - Cited by 29 (14 self) - Add to MetaCart
that measures the degree of interconnectedness between various pairs of nodes in the graph can significantly improve the quality of entity resolution. Furthermore, the paper presents an algorithm for making that technique self-adaptive to the underlying data, thus minimizing the required participation from
Next 10 →
Results 1 - 10 of 584
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University