• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Object/Relational Query Optimization with Chase and Backchase (2001)

by Lucian Popa
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 13
Next 10 →

On XML Integrity Constraints in the Presence of DTDs

by Wenfei Fan, Leonid Libkin
"... ..."
Abstract - Cited by 131 (20 self) - Add to MetaCart
Abstract not found
(Show Context)

Citation Context

... specifications have been proposed for XML, e.g., the XML standard (DTD) [6], XML Data [27] and XML Schema [36]. Keys and foreign keys for XML are important in, among other things, query optimization =-=[34]-=-, data integration [21], and in data transformations between XML and database formats [28]. Research affiliation: Bell Laboratories. 1 XML data usually comes with a DTD that specifies how a document i...

Reformulation of xml queries and constraints

by Alin Deutsch, Val Tannen, Alin Deutsch, Val Tannen - In ICDT’03 , 2003
"... Abstract. We state and solve the query reformulation problem for XML publishing in a general setting that allows mixed (XML and relational) storage for the proprietary data and exploits redundancies (materialized views, indexes and caches) to enhance performance. The correspondence between published ..."
Abstract - Cited by 103 (15 self) - Add to MetaCart
Abstract. We state and solve the query reformulation problem for XML publishing in a general setting that allows mixed (XML and relational) storage for the proprietary data and exploits redundancies (materialized views, indexes and caches) to enhance performance. The correspondence between published and proprietary schemas is specified by views in both directions, and the same algorithm performs rewriting-with-views, composition-with-views, or the combined effect of both, unifying the Global-As-View and Local-As-View approaches to data integration. We prove a completeness theorem which guarantees that under certain conditions, our algorithm will find a minimal reformulation if one exists. Moreover, we identify conditions when this algorithm achieves optimal complexity bounds. We solve the reformulation problem for constraints by exploiting a reduction to the problem of query reformulation. 1
(Show Context)

Citation Context

...When our reformulation algorithm produces multiple candidates, these should be further compared using application-specific cost models. This important step is outside the scope of this paper (but see =-=[27]-=-).spings RDB→XML or XML→RDB just by writing them in XQuery. Thus, the schema correspondence is given by several XQuery views (in both directions). We also take integrity constraints into consideration...

Integrity Constraints for XML

by Wenfei Fan, Jérôme Siméon , 1999
"... this paper, we extend XML DTDs with several classes of integrity constraints and investigate the complexity of reasoning about these constraints. The constraints range over keys, foreign keys, inverse constraints as well as ID constraints for capturing the semantics of object identities. They imp ..."
Abstract - Cited by 97 (11 self) - Add to MetaCart
this paper, we extend XML DTDs with several classes of integrity constraints and investigate the complexity of reasoning about these constraints. The constraints range over keys, foreign keys, inverse constraints as well as ID constraints for capturing the semantics of object identities. They improve semantic specifications and provide a better reference mechanism for native XML applications. They are also useful in information exchange and data integration for preserving the semantics of data originating in relational and object-oriented databases. We establish complexity and axiomatization results for the (finite) implication problems associated with these constraints. In addition, we study implication of more general constraints, such as functional, inclusion and inverse constraints defined in terms of navigation paths

MARS: A System for Publishing XML from Mixed and Redundant Storage

by Alin Deutsch, Val Tannen - In VLDB , 2003
"... We present a system for publishing as XML data from mixed (relational+XML) proprietary storage, while supporting redundancy in storage for tuning purposes. The correspondence between public and proprietary schemas is given by a combination of LAVand GAV-style views expressed in XQuery. ..."
Abstract - Cited by 75 (12 self) - Add to MetaCart
We present a system for publishing as XML data from mixed (relational+XML) proprietary storage, while supporting redundancy in storage for tuning purposes. The correspondence between public and proprietary schemas is given by a combination of LAVand GAV-style views expressed in XQuery.
(Show Context)

Citation Context

...d on incremental engineering of the original one, but rather on an entirely new paradigm. The main operation used in the C&B algorithm is the chase. The new implementation exploits the observation in =-=[25]-=- that chasing a query Q with a constraint c can be viewed as evaluating a relational query obtained from c over a small database obtained from Q. This enables us to apply classical query processing te...

On Verifying Consistency of XML Specifications

by Marcelo Arenas, Wenfei Fan, Leonid Libkin
"... XML specifications often consist of a type definition (typically, a DTD) and a set of integrity constraints. It has been shown previously that such specifications can be inconsistent, and thus it is often desirable to check consistency at compile-time. It is known that for general keys and foreign k ..."
Abstract - Cited by 49 (8 self) - Add to MetaCart
XML specifications often consist of a type definition (typically, a DTD) and a set of integrity constraints. It has been shown previously that such specifications can be inconsistent, and thus it is often desirable to check consistency at compile-time. It is known that for general keys and foreign keys, and DTDs, the consistency problem is undecidable; however, it becomes NP-complete when all keys are one-attribute (unary), and tractable, if no foreign keys are used.

Query reformulation with constraints

by Alin Deutsch, Lucian Popa, Val Tannen - SIGMOD Record
"... Let Σ1, Σ2 be two schemas, which may overlap, C be a set of constraints on the joint schema Σ1 ∪ Σ2, and q1 be a Σ1-query. An (equivalent) reformulation of q1 in the presence of C is a Σ2-query, q2, such that q2 gives the same answers as q1 on ..."
Abstract - Cited by 38 (4 self) - Add to MetaCart
Let Σ1, Σ2 be two schemas, which may overlap, C be a set of constraints on the joint schema Σ1 ∪ Σ2, and q1 be a Σ1-query. An (equivalent) reformulation of q1 in the presence of C is a Σ2-query, q2, such that q2 gives the same answers as q1 on
(Show Context)

Citation Context

...more distantly [6, 23], covered complex values and OO classes with extents. A comprehensive approach to query optimization for this model, including join (usual and dependent) reordering, appeared in =-=[28]-=-, see also[29]. Query reformulation is also essential for data publishing [32, 12] where Σ1 is the public schema and Σ2 the proprietary (storage) schema. It is equally essential in schema evolution wh...

Finding equivalent rewritings in the presence of arithmetic comparisons

by Foto Afrati, Manolis Gergatsoulis - In EDBT , 2006
"... Abstract. The problem of rewriting queries using views has received significant attention because of its applications in a wide variety of datamanagement problems. For select-project-join SQL (a.k.a. conjunctive) queries and views, there are efficient algorithms in the literature, which find equival ..."
Abstract - Cited by 4 (2 self) - Add to MetaCart
Abstract. The problem of rewriting queries using views has received significant attention because of its applications in a wide variety of datamanagement problems. For select-project-join SQL (a.k.a. conjunctive) queries and views, there are efficient algorithms in the literature, which find equivalent and maximally contained rewritings. In the presence of arithmetic comparisons (ACs) the problem becomes more complex. We do not know how to find maximally contained rewritings in the general case. There are algorithms which find maximally contained rewritings only for special cases such as when ACs are restricted to be semi-interval. However, we know that the problem of finding an equivalent rewriting (if there exists one) in the presence of ACs is decidable, yet still doubly exponential. This complexity calls for an efficient algorithm which will perform better on average than the complete enumeration algorithm. In this work we present such an algorithm which is sound and complete. Its efficiency lies in that it considers fewer candidate rewritings because it includes a preliminary test to decide for each view whether it is potentially useful in some rewriting. 1
(Show Context)

Citation Context

... useful view subgoals. These useful view subgoals are found by using techniques for finding rewritings of queries and views without arithmetic comparisons. In more detail, we use chaselike techniques =-=[7, 19, 5]-=- to find candidate useful subgoals and then we prune the space even further by using techniques used in finding maximally contained rewritings [20]. The main challenge of our algorithm however comes f...

Using views to generate efficient evaluation plans for queries

by Foto N. Afrati , Chen Li, Jeffrey D. Ullman , 2007
"... We study the problem of generating efficient, equivalent rewritings using views to compute the answer to a query. We take the closed-world assumption, in which views are materialized from base relations, rather than views describing sources in terms of abstract predicates, as is common when the open ..."
Abstract - Cited by 4 (3 self) - Add to MetaCart
We study the problem of generating efficient, equivalent rewritings using views to compute the answer to a query. We take the closed-world assumption, in which views are materialized from base relations, rather than views describing sources in terms of abstract predicates, as is common when the open-world assumption is used. In the closed-world model, there can be an infinite number of different rewritings that compute the same answer, yet have quite different performance. Query optimizers take a logical plan (a rewriting of the query) as an input, and generate efficient physical plans to compute the answer. Thus our goal is to generate a small subset of the possible logical plans without missing an optimal physical plan. We first consider a cost model that counts the number of subgoals in a physical plan, and show a search space that is guaranteed to include an optimal rewriting, if the query has a rewriting in terms of the views. We also develop an efficient algorithm for finding rewritings with the minimum number of subgoals. We then consider a cost model that counts the sizes of intermediate relations of a physical plan, without dropping any attributes, and give a search space for finding optimal rewritings. Our final cost model allows attributes to be dropped in intermediate relations. We show that, by careful variable renaming, it is possible to do better than the standard “supplementary relation ” approach, by dropping attributes that the latter approach would retain. Experiments show that our algorithm of generating optimal rewritings has good efficiency and scalability.

Finite Implication of Keys and Foreign Keys for XML Data

by Wenfei Fan, Leonid Libkin , 2000
"... We investigate finite implication problems associated with key and foreign key constraints for XML data. We demonstrate that there is interaction between DTDs and these constraints, and the interaction complicates the analysis of finite implication. In particular, we establish complexity results for ..."
Abstract - Cited by 3 (3 self) - Add to MetaCart
We investigate finite implication problems associated with key and foreign key constraints for XML data. We demonstrate that there is interaction between DTDs and these constraints, and the interaction complicates the analysis of finite implication. In particular, we establish complexity results for reasoning about two classes of constraints for XML. One class, L u , defines unary keys and foreign keys, and the other, L, consists of multi-attribute keys and foreign keys. We show that the finite implication problem for L u constraints is coNP-complete, and the finite implication problem for L constraints is undecidable. We also identify several PTIME decidable cases of the finite implication problems. In addition, we improve the results established in [16] by showing that the finite satisfiability problem for unary keys, unary inclusion constraints and their negations is NP-complete.
(Show Context)

Citation Context

... XML Schema [29], XML Data [21] and in a recent proposal [6]. They express a fundamental part of the semantics of XML data and are important in, among other things, query formulation and optimization =-=[27]-=-, data transformation and integration [18], and in data exchange for converting databases to an XML encoding. To take advantage of keys and foreign keys, one needs to reason about them. There are two ...

Using Views to Generate Efficient Evaluation Plans for Queries ∗

by Foto N. Afrati A, Chenli B, Jeffrey D. Ullman C
"... We study the problem of generating efficient, equivalent rewritings using views to compute the answer to a query. We take the closed-world assumption, in which views are predicates, as is common when the open-world assumption is used. In the closed-world model, there can be an infinite number of dif ..."
Abstract - Add to MetaCart
We study the problem of generating efficient, equivalent rewritings using views to compute the answer to a query. We take the closed-world assumption, in which views are predicates, as is common when the open-world assumption is used. In the closed-world model, there can be an infinite number of different rewritings that compute the same answer, yet have quite different performance. Query optimizers take a logical plan (a rewriting of the query) as an input, and generate efficient physical plans to compute the answer. Thus our goal is to generate a small subset of the possible logical plans without missing an optimal physical plan. We first consider a cost model that counts the number of subgoals in a physical plan, and show a search space that is guaranteed to include an optimal rewriting, if the query has a rewriting in terms of the views. We also develop an efficient algorithm for finding rewritings with the minimum number of subgoals. We then consider a cost model that counts the sizes of intermediate relations of a physical plan, without dropping any attributes, and give a search space for finding optimal rewritings. Our final cost model allows attributes to be dropped in intermediate relations. We show that, by careful variable renaming, it is possible to do better than the standard “supplementary relation ” approach, by dropping attributes that the latter approach would retain. Experiments show that our algorithm of generating optimal rewritings has good efficiency and scalability. 1.
(Show Context)

Citation Context

...t queries based on certain constraints. In our setting, these constraints are the views. Chase has been used in query optimization for deciding equivalence of queries (see, e.g., Lucian Popa’s thesis =-=[21]-=-). We have shown that the search space for globally minimal rewritings is finite by stating in Theorem 3.1 that it is sufficient to search in the space of all LMRs of a query that use only view tuples...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University