Abstract:
This paper takes a first step towards the design and normalization theory for XML documents. We show that, like relational databases, XML documents may contain redundant information, and may be prone to update anomalies. Furthermore, such problems are caused by certain functional dependencies among paths in the document. Our goal is to find a way of converting an arbitrary DTD into a well-designed one, that avoids these problems. We first introduce the concept of a functional dependency for XML, and define its semantics via a relational representation of XML. We then define an XML normal form, XNF, that avoids update anomalies and redundancies. We study its properties and show that it generalizes BCNF and a normal form for nested relations when those are appropriately coded as XML documents. Finally, we present a lossless algorithm for converting any DTD into one in XNF.
Citations
|
1028
|
Foundations of Databases
– Abiteboul, Hull, et al.
- 1995
|
|
328
|
Relational Databases for Querying XML Documents: Limitations and Opportunities
– Shanmugasundaram, Tufte, et al.
- 1999
|
|
239
|
Linear-Time Algorithms for Testing the Satisfiability of Propositional Horn Formulae
– Dowling, Gallier
- 1984
|
|
234
|
Semantics of Programming Languages: Structures and Techniques. Foundations of Computing
– Gunter
- 1992
|
|
190
|
Incomplete information in relational databases
– Imielinski, Lipski
- 1984
|
|
136
|
Updating XML
– Tatarinov, Ives, et al.
- 2001
|
|
107
|
Storing and querying XML data using an RDMBS
– Florescu, Kossman
- 1999
|
|
100
|
Elements of Relational Database Theory
– Kanellakis
- 1990
|
|
84
|
Relative Information Capacity of Simple Relational Database Schemata
– Hull
- 1986
|
|
82
|
Efficient storage of XML data
– Kanne, Moerkotte
- 2000
|
|
77
|
On XML integrity constraints in the presence of dtds
– Fan, Libkin
|
|
69
|
Integrity constraints for XML
– Fan, Simeon
- 2003
|
|
62
|
Keys for xml
– Buneman, Davidson, et al.
- 2001
|
|
57
|
Using powerdomains to generalize relational databases
– Buneman, Ohori
- 1989
|
|
48
|
W.: Reasoning about keys for XML
– Buneman, Davidson, et al.
- 2003
|
|
47
|
A new normal form for nested relations
– ÖZSOYOGLU, L-Y
- 1987
|
|
38
|
W.Y.: Developing XML documents with guaranteed ”good” properties
– Embley, Mok
- 2001
|
|
37
|
The Problem of Incomplete Information in Relational Databases
– Grahne
- 1991
|
|
32
|
Representing and querying xml with incomplete information
– Abiteboul, Segoufin, et al.
|
|
32
|
Bounded fixpoints for complex objects
– Suciu
- 1993
|
|
30
|
L.: An information-theoretic approach to normal forms for relational and XML data
– Arenas, Libkin
- 2005
|
|
27
|
Correction to ”an equivalence between relational database dependencies and a fragment of propositional logic
– Sagiv, Delobel, et al.
- 1987
|
|
21
|
W.L.: Designing functional dependencies for XML
– Lee, Ling, et al.
- 2002
|
|
20
|
A normal form for precisely characterizing redundancy in nested relations
– MOK, NG, et al.
- 1996
|
|
16
|
Object normal forms and dependency constraints for object-oriented schemata’, Trans. Database Syst
– Tari, Stokes, et al.
- 1997
|
|
11
|
Normal Form Algorithms for Extended Context-Free Grammars
– Albert, Giammarresi, et al.
- 2001
|
|
10
|
Axiomatisation of functional dependencies in incomplete relations
– Levene, Loizou
- 1998
|
|
7
|
Functional dependencies in relations with null values
– Atzeni, Morfuni
- 1984
|
|
2
|
Simulation of the nested relational algebra by the flat relational algebra, with an application to the complexity of evaluating powerset algebra expressions. Theoret
– Arenas, BUSSCHE, et al.
- 2001
|
|
1
|
A Normal Form for XML Documents • 231
– BEERI, BERNSTEIN, et al.
- 1978
|