• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

IQ: The Case for Iterative Querying for Knowledge

by Yosi Mass
Add To MetaCart

Tools

Sorted by:
Results 1 - 5 of 5

Finding a minimal tree pattern under neighborhood constraints

by Benny Kimelfeld, Yehoshua Sagiv - In PODS , 2011
"... Tools that automatically generate queries are useful when schemas are hard to understand due to size or complexity. Usually, these tools find minimal tree patterns that contain a given set (or bag) of labels. The labels could be, for example, XML tags or relation names. The only restriction is that, ..."
Abstract - Cited by 2 (1 self) - Add to MetaCart
Tools that automatically generate queries are useful when schemas are hard to understand due to size or complexity. Usually, these tools find minimal tree patterns that contain a given set (or bag) of labels. The labels could be, for example, XML tags or relation names. The only restriction is that, in a tree pattern, adjacent labels must be among some specified pairs. A more expressive framework is developed here, where a schema is a mapping of each label to a collection of bags of labels. A tree pattern conforms to the schema if for all nodes v, the bag comprising the labels of the neighbors is contained in one of the bags to which the label of v is mapped. The problem at hand is to find a minimal tree pattern that conforms to the schema and contains a given bag of labels. This problem is NP-hard even when using the simplest conceivable language for describing schemas. In practice, however, the set of labels is small, so efficiency is realized by means of an algorithm that is fixed-parameter tractable (FPT). Two languages for specifying schemas are discussed. In the first, one expresses pairwise mutual exclusions between labels. Though W[1]-hardness (hence, unlikeliness of an FPT algorithm) is shown, an FPT algorithm is described for the case where the mutual exclusions form a circular-arc graph (e.g., disjoint cliques). The second language is that of regular expres-sions, and for that another FPT algorithm is described.
(Show Context)

Citation Context

...ty to use existing systems (e.g., RDBMS) that already have highly optimized query engines. Thus, the pattern approach can be implemented rather easily over such systems. Furthermore, as enunciated in =-=[19]-=-, keyword search over data graphs is carried out across two dimensions. In one dimension, the user explores the different semantic connections (where each one is represented by a tree pattern). In the...

Entity-Centric Search For Enterprise Services

by Marcus Roy, Ingo Weber, Boualem Benatallah
"... Abstract. The consumption of APIs, such as Enterprise Services (ESs) in an enterprise Service-Oriented Architecture (eSOA), has largely been a task for experienced developers. With the rapidly growing number of such (Web)APIs, users with little or no experience in a given API face the problem of fin ..."
Abstract - Add to MetaCart
Abstract. The consumption of APIs, such as Enterprise Services (ESs) in an enterprise Service-Oriented Architecture (eSOA), has largely been a task for experienced developers. With the rapidly growing number of such (Web)APIs, users with little or no experience in a given API face the problem of finding relevant API operations – e.g., mashups developers. However, building an effective search has been a challenge: Information Retrieval (IR) methods struggle with the brevity of text in API descriptions, whereas semantic search technologies require domain ontologies and formal queries. Motivated by the search behavior of users, we propose an iterative keyword search based on entities. The entities are part of a knowledge base, whose content stems from model-driven engineering. We implemented our approach and conducted a user study showing significant improvements in search effectiveness. 1
(Show Context)

Citation Context

.... In a similar context, we consider the relationship defined in automata to rank types of entities and use entity frequency to identify key ESs. Second, our iterative querying paradigm is inspired by =-=[13]-=-. ActiveObjects [11] advocates the learning of actions on entities extracted from Web search logs – intriguing for Web queries, less applicable for service design with a well defined action vocabulary...

Lenses: An On-Demand Approach to ETL

by Ying Yang, Niccolo ̀ Meneghetti, Ronny Fehling, Zhen Hua Liu, Oliver Kennedy
"... Three mentalities have emerged in analytics. One view holds that reliable analytics is impossible without high-quality data, and relies on heavy-duty ETL processes and upfront data curation to provide it. The second view takes a more ad-hoc approach, collecting data into a data lake, and plac-ing re ..."
Abstract - Add to MetaCart
Three mentalities have emerged in analytics. One view holds that reliable analytics is impossible without high-quality data, and relies on heavy-duty ETL processes and upfront data curation to provide it. The second view takes a more ad-hoc approach, collecting data into a data lake, and plac-ing responsibility for data quality on the analyst querying it. A third, on-demand approach has emerged over the past decade in the form of numerous systems like Paygo or HLog, which allow for incremental curation of the data and help analysts to make principled trade-offs between data quality and effort. Though quite useful in isolation, these systems target only specific quality problems (e.g., Paygo targets only schema matching and entity resolution). In this paper, we explore the design of a general, extensible infrastructure for on-demand curation that is based on probabilistic query processing. We illustrate its generality through examples and show how such an infrastructure can be used to grace-fully make existing ETL workflows “on-demand”. Finally, we present a user interface for On-Demand ETL and address ensuing challenges, including that of efficiently ranking po-tential data curation tasks. Our experimental results show that On-Demand ETL is feasible and that our greedy rank-ing strategy for curation tasks, called CPI, is effective. 1.
(Show Context)

Citation Context

... generalizes these, providing a tool for ondemand data curation that these solutions can be plugged into. On-demand curation can also be thought of as a highlytargeted form of crowd-sourced databases =-=[32]-=-, which leverage the power of humans to perform complex tasks. The problem of incomplete data arises frequently in distributed systems, where node failures are common. Existing solutions based on unce...

Wearable Queries: Adapting Common Retrieval Needs to Data and Users

by Barbara Catania, Giovanna Guerrini, Alberto Belussi, Federica Mandreoli, Riccardo Martoglia, Wilma Penzo
"... The wealth of information generated by users interacting with the network and its applications is often under-utilized due to complications in accessing heterogeneous and dy-namic data and retrieving relevant information from sources having possibly unknown formats and structures. Process-ing comple ..."
Abstract - Add to MetaCart
The wealth of information generated by users interacting with the network and its applications is often under-utilized due to complications in accessing heterogeneous and dy-namic data and retrieving relevant information from sources having possibly unknown formats and structures. Process-ing complex requests on such information sources can, thus, be costly, though not guaranteeing user satisfaction. Fur-thermore, dynamic contexts prevent substantial user involve-ment in the interpretation of the request. The paper envisions an innovative solution to process the above mentioned requests, limiting user involvement by ex-ploiting information on: (a) user context (geo-location, in-terests, needs); (b) data and processing quality; (c) similar requests repeated over time. By interpreting a request in a novel way by means of a Wearable Query (WQ), i.e., a query that captures the user and request specificities, we envision a methodological and technological solution for WQs in the presence of repeated information needs in distributed, het-erogeneous, dynamic environments, with emphasis on the geo-spatial dimension and on data quality. 1.
(Show Context)

Citation Context

...uire a processing time inadequate to the urgency of the request. The solution proposed in the literature, that is, a significant involvement of the user during the process of interpreting the request =-=[7]-=-, does not seem adequate in dynamic contexts, in which the interaction is often hampered by the communication means and by the need to quickly obtain answers. The paper envisages an approach for provi...

A Personal Perspective on Keyword Search over Data Graphs *

by Yehoshua Sagiv
"... ABSTRACT Theoretical and practical issues pertaining to keyword search over data graphs are discussed. A formal model and algorithms for enumerating answers (by operating directly on the data graph) are described. Various aspects of a system are explained, including the object-connector-property da ..."
Abstract - Add to MetaCart
ABSTRACT Theoretical and practical issues pertaining to keyword search over data graphs are discussed. A formal model and algorithms for enumerating answers (by operating directly on the data graph) are described. Various aspects of a system are explained, including the object-connector-property data model, how it is used to construct a data graph from an XML document, how to deal with redundancies in the source data, what are duplicate answers, implementation and GUI. An approach to ranking that combines textual relevance with semantic considerations is described. It is argued that search over data graphs is inherently a two-dimensional process, where the goal is not just to find particular content but also to collect information on how the desired data may be semantically connected.
(Show Context)

Citation Context

...hose keywordsshouldappeartogetherinthesamenodeofananswer. In [30], it is shown that these two features can enhance the performance (i.e., increase the MAP). 6. TWO DIMENSIONAL SEARCH As enunciated in =-=[28]-=-, keyword search over data graph is inherently a two-dimensional process. A specific answer is characterized by its content as well as by its pattern. The latter refers to the tree structure of an ans...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University