by AnHai Doan Co-Chairs of Supervisory Committee: Professor Alon Y. Halevy Computer Science & Engineering Professor Pedro M. Domingos Computer Science & Engineering This dissertation studies representation matching: the problem of creating semantic mappings between two data representations. Examples of data representations are relational schemas, ontologies, and XML DTDs. Examples of semantic mappings include "element location of one representation maps to element address of the other", "contact-phone maps to agent-phone", and "listed-price maps to price * (1 + tax-rate)".
|
4923
|
Elements of Information Theory
– Cover, Thomas
- 1991
|
|
3011
|
Pattern Classification and Scene Analysis
– Duda, Hart
- 1973
|
|
871
|
Federated database systems for managing distributed, heterogeneous, and autonomous databases
– Sheth, Larson
- 1990
|
|
595
|
Querying Heterogeneous Information Sources Using Source Descriptions
– Levy, Rajaraman, et al.
- 1996
|
|
514
|
A comparison of event models for naive bayes text classification
– McCallum, Nigam
- 1998
|
|
500
|
Categorical Data Analysis
– Agresti
- 1990
|
|
489
|
A Formal Basis for the Heuristic Determination of Minimum Cost Paths
– Hart, Nilsson, et al.
- 1968
|
|
379
|
Stacked generalization
– Wolpert
- 1992
|
|
363
|
On the optimality of the simple Bayesian classifier under zero-one loss
– Domingos, Pazzani
- 1997
|
|
354
|
Ontologies: Silver Bullet for Knowledge Management and Electronic
– Fensel
|
|
325
|
An information-theoretic definition of similarity
– Lin
- 1998
|
|
319
|
Generic schema matching with Cupid
– Madhavan, Bernstein, et al.
- 2001
|
|
280
|
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment
– Noy, Musen
- 2000
|
|
255
|
Reconciling schemas of disparate data sources: A machine-learning approach
– Doan, Domingos, et al.
- 2001
|
|
254
|
Enhanced hypertext categorization using hyperlinks
– Chakrabarti, Dom, et al.
- 1998
|
|
232
|
Similarity Flooding: A Versatile Graph Matching Algorithm
– Melnik, Molina-Garcia, et al.
- 2002
|
|
221
|
2002 Ontology Learning for the Semantic Web
– Maedche
|
|
194
|
COMA: A system for flexible combination of schema matching approaches
– Do, Rahm
- 2002
|
|
189
|
On the foundations of relaxation labeling processes
– HUMMEL, ZUCICER
- 1983
|
|
174
|
Quilt: An XML Query Language for Heterogeneous Data Sources
– Chamberlin, Robie, et al.
- 2000
|
|
174
|
Using schema matching to simplify heterogeneous data translation
– Milo, Zohar
- 1998
|
|
165
|
Wrapper Generation for Semi-structured Internet Sources
– Ashish, Knoblock
- 1997
|
|
162
|
Wrapper induction: Efficiency and expressiveness
– Kushmerick
- 2000
|
|
144
|
An Adaptive Query Execution System for Data Integration
– Ives, Florescu, et al.
- 1999
|
|
138
|
Schema mapping as query discovery
– Miller, Haas, et al.
- 2000
|
|
120
|
Applying Model Management to Classical Meta Data Problems
– Bernstein
- 2003
|
|
107
|
Modeling web sources for information integration
– Knoblock, Minton, et al.
- 1998
|
|
87
|
OntoMorph: A Translation System for Symbolic Knowledge
– Chalupsky
|
|
87
|
Data cleaning: Problems and current approaches
– Rahm, Do
|
|
84
|
SemInt - A Tool for Identifying Attribute Correspondences in Heterogeneous Databases Using Neural Network
– Li, Clifton
|
|
83
|
Semantic integration of heterogeneous information sources
– Bergamaschi, Castano, et al.
|
|
83
|
R.: A vision of management of complex models
– Bernstein, Halevy, et al.
- 2000
|
|
82
|
Anchor-PROMPT: using non-local context for semantic matching
– Noy, Musen
|
|
80
|
Representing and reasoning about mappings between domain models
– Madhavan, Bernstein, et al.
- 2002
|
|
79
|
Machine learning for information extraction in informal domains
– Freitag
- 1998
|
|
75
|
The chimaera ontology environment
– McGuinness, Fikes, et al.
- 2000
|
|
74
|
Category translation: learning to understand information on the internet
– Perkowitz, Etzioni
- 1995
|
|
71
|
Comparison of schema matching evaluations
– Do, Melnik, et al.
- 2002
|
|
69
|
D.: The use of classifiers in sequential inference
– Punyakanok, Roth
|
|
68
|
A portrait of the semantic web in action
– Heflin, Hendler
|
|
68
|
Data-driven understanding and refinement of schema mappings
– Yan, Miller, et al.
- 2001
|
|
61
|
Semi-automatic integration of knowledge sources. Proc. of the 2nd Int. Conf. On Information FUSION'99
– Mitra, Wiederhold
- 1999
|
|
60
|
On matching schemas automatically
– Rahm, Bernstein
- 2001
|
|
54
|
Semantic Integration in Heterogeneous Databases Using Neural Networks
– Li, Clifton
- 2001
|
|
50
|
Issues in stacked generalization
– Ting, Witten
- 1999
|
|
48
|
Optimizing recursive information gathering plans
– Lambrecht, Kambhampati, et al.
- 1999
|
|
43
|
Joins that generalize: text classification using WHIRL
– Cohen, Hirsch
- 1998
|
|
42
|
A formal view integration method
– Biskup, Convent
- 1986
|
|
41
|
Promptdiff: A fixed-point algorithm for comparing ontology versions
– Noy, Musen
- 2002
|
|
38
|
Issues and approaches of database integration
– Parent, S
- 1998
|