• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Pareto-based dominant graph: An efficient indexing structure to answer top-k queries (0)

by L Zou, L Chen
Venue:IEEE Trans. Knowl. Data Eng
Add To MetaCart

Tools

Sorted by:
Results 1 - 5 of 5

Reverse Top-k Queries

by Akrivi Vlachou, Christos Doulkeridis, Yannis Kotidis, Kjetil Nørvåg
"... Rank-aware query processing has become essential for many applications that return to the user only the top-k objects based on the individual user’s preferences. Top-k queries have been mainly studied from the perspective of the user, focusing primarily on efficient query processing. In this work, ..."
Abstract - Cited by 9 (3 self) - Add to MetaCart
Rank-aware query processing has become essential for many applications that return to the user only the top-k objects based on the individual user’s preferences. Top-k queries have been mainly studied from the perspective of the user, focusing primarily on efficient query processing. In this work, for the first time, we study top-k queries from the perspective of the product manufacturer. Given a potential product, which are the user preferences for which this product is in the topk query result set? We identify a novel query type, namely reverse top-k query, that is essential for manufacturers to assess the potential market and impact of their products based on the competition. We formally define reverse top-k queries and introduce two versions of the query, namely monochromatic and bichromatic. We first provide a geometric interpretation of the monochromatic reverse top-k query in the solution space that helps to understand the reverse top-k query conceptually. Then, we study in more details the case of bichromatic reverse topk query, which is more interesting for practical applications. Such a query, if computed in a straightforward manner, requires evaluating a top-k query for each user preference in the database, which is prohibitively expensive even for moderate datasets. In this paper, we present an efficient threshold-based algorithm that eliminates candidate user preferences, without processing the respective top-k queries. Furthermore, we introduce an indexing structure based on materialized reverse top-k views in order to speed up the computation of reverse top-k queries. Materialized reverse top-k views trade preprocessing cost for query speed up in a controllable manner. Our experimental evaluation demonstrates the efficiency of our techniques, which reduce the required number of top-k computations by 1 to 3 orders of magnitude.

1 Monochromatic and Bichromatic Reverse Top-k Queries

by Akrivi Vlachou, Christos Doulkeridis, Yannis Kotidis
"... Abstract—Nowadays, most applications return to the user a limited set of ranked results based on the individual user’s preferences, which are commonly expressed through top-k queries. From the perspective of a manufacturer, it is imperative that her products appear in the highest ranked positions fo ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
Abstract—Nowadays, most applications return to the user a limited set of ranked results based on the individual user’s preferences, which are commonly expressed through top-k queries. From the perspective of a manufacturer, it is imperative that her products appear in the highest ranked positions for many different user preferences, otherwise the product is not visible to potential customers. In this paper, we define a novel query type, namely the reverse top-k query, that covers this requirement: ”Given a potential product, which are the user preferences that make this product belong to the top-k query result set?”. Reverse top-k queries are essential for manufacturers to assess the impact of their products in the market based on the competition. We formally define reverse top-k queries and introduce two versions of the query, monochromatic and bichromatic. First, we provide a geometric interpretation of the monochromatic reverse topk query to acquire an intuition of the solution space. Then, we study in detail the case of bichromatic reverse top-k query, and we propose two techniques for query processing, namely an efficient thresholdbased algorithm and an algorithm based on materialized reverse topk views. Our experimental evaluation demonstrates the efficiency of our techniques. Index Terms—reverse top-k query, top-k query, user preferences 1

(will be inserted by the editor) Efficient Processing of Exact Top-k Queries over Sorted Lists

by Hwee Hwa Pang, Xuhua Ding, Baihua Zheng, Hweehwa Pang, Xuhua Ding, Baihua Zheng
"... the date of receipt and acceptance should be inserted later Abstract The top-k query is employed in a wide range of applications to generate a ranked list of data that have the highest aggregate scores over certain attributes. As the pool of attributes for selection by individual queries may be larg ..."
Abstract - Add to MetaCart
the date of receipt and acceptance should be inserted later Abstract The top-k query is employed in a wide range of applications to generate a ranked list of data that have the highest aggregate scores over certain attributes. As the pool of attributes for selection by individual queries may be large, the data are indexed with perattribute sorted lists, and a threshold algorithm is applied on the lists involved in each query. The threshold algorithm executes in two phases – find a cut-off threshold for the top-k result scores, then evaluate all the records that could score above the threshold. In this paper, we focus on exact top-k queries that involve monotonic linear scoring functions over diskresident sorted lists. We introduce a model for estimating the depths to which each sorted list needs to be processed

Answering Top-k Queries Over a Mixture of Attractive and Repulsive Dimensions ABSTRACT

by Sayan Ranu
"... In this paper, we formulate a top-k query that compares objects in a database to a user-provided query object on a novel scoring function. The proposed scoring function combines the idea of attractive and repulsive dimensions into a general framework to overcome the weakness of traditional distance ..."
Abstract - Add to MetaCart
In this paper, we formulate a top-k query that compares objects in a database to a user-provided query object on a novel scoring function. The proposed scoring function combines the idea of attractive and repulsive dimensions into a general framework to overcome the weakness of traditional distance or similarity measures. We study the properties of the proposed class of scoring functions and develop efficient and scalable index structures that index the isolines of the function. We demonstrate various scenarios where the query finds application. Empirical evaluation demonstrates a performance gain of one to two orders of magnitude on querying time over existing state-of-the-art top-k techniques. Further, a qualitative analysis is performed on a real dataset to highlight the potential of the proposed query in discovering hidden data characteristics. 1.

Efficient and Domain-Invariant Competitor Mining

by George Valkanas, Dimitrios Gunopulos
"... In any competitive business, success is based on the ability to make an item more appealing to customers than the competition. A number of questions arise in the context of this task: how do we formalize and quantify the competitiveness relationship between two items? Who are the true competitors of ..."
Abstract - Add to MetaCart
In any competitive business, success is based on the ability to make an item more appealing to customers than the competition. A number of questions arise in the context of this task: how do we formalize and quantify the competitiveness relationship between two items? Who are the true competitors of a given item? What are the features of an item that most affect its competitiveness? Despite the impact and relevance of this problem to many domains, only a limited amount of work has been devoted toward an effective solution. In this paper, we present a formal definition of the competitiveness between two items. We present efficient methods for evaluating competitiveness in large datasets and address the natural problem of finding the top-k competitors of a given item. Our methodology is evaluated a-gainst strong baselines via a user study and experiments on multiple datasets from different domains.
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University