Characterization of Database Access Pattern for Analytic Prediction of Buffer Hit Probability (1995)
| Venue: | VLDB Journal |
| Citations: | 27 - 1 self |
BibTeX
@ARTICLE{Dan95characterizationof,
author = {Asit Dan and Philip S. Yu and Jen-yao Chung},
title = {Characterization of Database Access Pattern for Analytic Prediction of Buffer Hit Probability},
journal = {VLDB Journal},
year = {1995},
volume = {4},
pages = {127--154}
}
Years of Citing Articles
OpenURL
Abstract
Abstract. The analytic prediction of buffer hit probability, based on the charac-terization of database accesses from real reference traces, is extremely useful for workload management and system capacity planning. The knowledge can be help-ful for proper allocation of buffer space to various database relations, as well as for the management of buffer space for a mixed transaction and query environment. Access characterization can also be used to predict the buffer invalidation effect in a multi-node environment which, in turn, can influence transaction routing strate-gies. However, it is a challenge to characterize the database access pattern of a real workload reference trace in a simple manner that can easily be used to compute buffer hit probability. In this article, we use a characterization method that distin-guishes three types of access patterns from a trace: (1) locality within a transaction, (2) random accesses by transactions, and (3) sequential accesses by long queries. We then propose a concise way to characterize the access skew across randomly accessed pages by logically grouping the large number of data pages into a small number of partitions such that the frequency of accessing each page within a par-tition can be treated as equal. Based on this approach, we present a recursive binary partitioning algorithm that can infer the access skew characterization from the buffer hit probabilities for a subset of the buffer sizes. We validate the buffer hit predictions for single and multiple node systems using production database traces. We further show that the proposed approach can predict the buffer hit probability of a composite workload from those of its component files. Key Words. Database access characterization, access skew, sequential access, ref-erence trace, workload management, analytic prediction.







