Efficient Skyline and Top-k Retrieval in Subspaces
Cached
Download Links
- [www.cse.cuhk.edu.hk]
- [www.cse.cuhk.edu.hk]
- [www.cs.sfu.ca]
- DBLP
Other Repositories/Bibliography
| Citations: | 12 - 0 self |
BibTeX
@MISC{Tao_efficientskyline,
author = {Yufei Tao and Xiaokui Xiao and Jian Pei},
title = {Efficient Skyline and Top-k Retrieval in Subspaces},
year = {}
}
OpenURL
Abstract
Skyline and top-k queries are two popular operations for preference retrieval. In practice, applications that require these operations usually provide numerous candidate attributes, whereas, depending on their interests, users may issue queries regarding different subsets of the dimensions. The existing algorithms are inadequate for subspace skyline/top-k search because they have at least one of the following defects: they (i) require scanning the entire database at least once; (ii) are optimized for one subspace but incur significant overhead for other subspaces; (iii) demand expensive maintenance cost or space consumption. In this paper, we propose a technique, SUBSKY, which settles both types of queries using purely relational technologies. The core of SUBSKY is a transformation that converts multidimensional data to 1D values. These values are indexed by a simple B-tree, which allows us to answer subspace queries by accessing a fraction of the database. SUBSKY entails low maintenance overhead, which equals the cost of updating a traditional B-tree. Extensive experiments with real data confirm that our technique outperforms alternative solutions significantly in both efficiency and scalability.







