Results 1 -
2 of
2
Dynamic Maintenance of Data Distribution for Selectivity Estimation
- The VLDB Journal
, 1994
"... We propose a new dynamic method for multidimensional selectivity estimation for range queries that works accurately independent of data distribution. Good estimation of selectivity is important for query optimization and physical database design. Our method employs the Multilevel Grid File (MLGF) fo ..."
Abstract
-
Cited by 21 (9 self)
- Add to MetaCart
We propose a new dynamic method for multidimensional selectivity estimation for range queries that works accurately independent of data distribution. Good estimation of selectivity is important for query optimization and physical database design. Our method employs the Multilevel Grid File (MLGF) for accurate estimation of multidimensional data distribution. The MLGF is a dynamic hierarchical balanced multidimensional file structure that gracefully adapts to nonuniform and correlated distributions. We show that the MLGF directory naturally represents a multidimensional data distribution. We then extend it for further refinement and present the selectivity estimation method based on the MLGF. Extensive experiments have been performed to test the accuracy of selectivity estimation. The results show that estimation errors are very small independent of distributions even with correlated and/or highly-skewed ones. Finally, we analyze the cause of errors in estimation and investigate the eff...
(~)VLDB Dynamic Maintenance of Data Distribution for Selectivity Estimation
, 1991
"... Abstract. We propose a new dynamic method for multidimensional selectivity estimation for range queries that works accurately independent of data distribution. Good estimation of selectivity is important for query optimization and physical database design. Our method employs the multilevel grid file ..."
Abstract
- Add to MetaCart
Abstract. We propose a new dynamic method for multidimensional selectivity estimation for range queries that works accurately independent of data distribution. Good estimation of selectivity is important for query optimization and physical database design. Our method employs the multilevel grid file (MLGF) for accurate estimation of multidimensional data distribution. The MLGF is a dynamic, hierarchical, balanced, multidimensional file structure that gracefully adapts to nonuniform and correlated distributions. We show that the MLGF directory naturally represents a multidimensional data distribution. We then extend it for further refinement and present the selectivity estimation method based on the MLGE Extensive experiments have been performed to test the accuracy of selectivity estimation. The results show that estimation errors are very small independent of distributions, even with correlated and/or highly skewed ones. Finally, we analyze the cause of errors in estimation and investigate the effects of various parameters on the accuracy of estimation. Key Words. Query optimization, physical database design, multidimensional file structure, multilevel grid files. 1.

