Improved Data Partitioning for Building Large ROLAP Data Cubes in Parallel
 Journal of Data Warehousing and Mining
, 2006
The precomputation of data cubes is critical to improving the response time of OnLine Analytical Processing (OLAP) systems and can be instrumental in accelerating data mining tasks in large data warehouses. However, as the size of data warehouses grows, the time it takes to perform this precomputation becomes a significant performance bottleneck.
Cited by 3
The precomputation of data cubes is critical to improving the response time of OnLine Analytical Processing (OLAP) systems and can be instrumental in accelerating data mining tasks in large data warehouses. However, as the size of data warehouses grows, the time it takes to perform this pre
Building Large ROLAP Data Cubes in Parallel
 IDEAS
, 2004
The precomputation of data cubes is critical to improving the response time of OnLine Analytical Processing (OLAP) systems and can be instrumental in accelerating data mining tasks in large data warehouses. However, as the size of data warehouses grows, the time it takes to perform this precomputation becomes a significant performance bottleneck.
Cited by 5
computation becomes a significant performance bottleneck. This paper presents a fast parallel method for generating ROLAP data cubes on a sharednothing multiprocessor based on a novel optimized data partitioning technique. Since no shared disk is required, this method can be applied on highly scalable processor
Implementing data cubes efficiently
 In SIGMOD
, 1996
Decision support applications involve complex queries on very large databases. Since response times should be small, query optimization is critical. Users typically view the data as multidimensional data cubes. Each cell of the data cube is a view consisting of an aggregation of interest, like total sales.
Cited by 545
Decision support applications involve complex queries on very large databases. Since response times should be small, query optimization is critical. Users typically view the data as multidimensional data cubes. Each cell of the data cube is a view consisting of an aggregation of interest, like
Parallel ROLAP Data Cubes performance Comparison and Analysis
, 2003
The precomputation of data cubes is critical to improving the response time of OnLine Analytical Processing (OLAP) systems and can be instrumental in accelerating data mining tasks in large data warehouses. In order to meet the need for improved performance created by growing data sizes, parallel processing techniques have been developed.
The precomputation of data cubes is critical to improving the response time of OnLine Analytical Processing (OLAP) systems and can be instrumental in accelerating data mining tasks in large data warehouses. In order to meet the need for improved performance created by growing data sizes, parallel
Dryad: Distributed DataParallel Programs from Sequential Building Blocks
 In EuroSys
, 2007
Dryad is a generalpurpose distributed execution engine for coarsegrain dataparallel applications. A Dryad application combines computational "vertices" with communication "channels" to form a dataflow graph. Dryad runs the application by executing the vertices of this graph on a set of available computers.
Cited by 730
Dryad is a generalpurpose distributed execution engine for coarsegrain dataparallel applications. A Dryad application combines computational “vertices ” with communication “channels ” to form a dataflow graph. Dryad runs the application by executing the vertices of this graph on a set
The cgmCUBE Project: Optimizing Parallel Data Cube Generation for ROLAP
Online Analytical Processing (OLAP) has become one of the most powerful and prominent technologies for knowledge discovery in VLDB (Very Large Database) environments. Central to the OLAP paradigm is the data cube, a multidimensional hierarchy of aggregate values that provides a rich analytical model.
Cited by 11
thorough optimization of the underlying sequen tial cube construction method and (2) a detailed and carefully engineered cost model for improved parallel load balancing and faster sequential cube construction. These optimizations were key in allowing us to build a prototype that is able to produce data
Data Security
, 1979
The rising abuse of computers and increasing threat to personal privacy through data banks have stimulated much interest in the technical safeguards for data. There are four kinds of safeguards, each related to but distinct from the others. Access controls regulate which users may enter the system and what resources they may use.
Cited by 611
The rising abuse of computers and increasing threat to personal privacy through data banks have stimulated much interest m the techmcal safeguards for data. There are four kinds of safeguards, each related to but distract from the others. Access controls regulate which users may enter the system
TopDown Computation Of Partial ROLAP Data Cubes
, 2004
The precomputation of the different summary views of a data cube is critical to improving the response time of data cube queries for OnLine Analytical Processing (OLAP). The computation of the full data cube, representing all 2^n views, has been studied extensively. However, the full cube is often too large to compute and store.
Cited by 2
down computation of partial ROLAP data cubes. We present both sequential and parallel methods for topdown partial data cube construction. Our experimental results indicate close to linear performance improvement for partial data cube computation. For example, when selecting 50% of the views our method requires
Models and issues in data stream systems
 IN PODS
, 2002
In this overview paper we motivate the need for and research issues arising from a new model of data processing. In this model, data does not take the form of persistent relations, but rather arrives in multiple, continuous, rapid, timevarying data streams. In addition to reviewing past work relevant to data stream processing, we describe applications and research issues.
Cited by 770
In this overview paper we motivate the need for and research issues arising from a new model of data processing. In this model, data does not take the form of persistent relations, but rather arrives in multiple, continuous, rapid, timevarying data streams. In addition to reviewing past work
Fast Parallel Algorithms for ShortRange Molecular Dynamics
 JOURNAL OF COMPUTATIONAL PHYSICS
, 1995
Three parallel algorithms for classical molecular dynamics are presented. The first assigns each processor a fixed subset of atoms; the second assigns each a fixed subset of interatomic forces to compute; the third assigns each a fixed spatial region. The algorithms are suitable for molecular dynamics models which can be difficult to parallelize efficiently.
Cited by 622
dynamics models which can be difficult to parallelize efficiently  those with shortrange forces where the neighbors of each atom change rapidly. They can be implemented on any distributedmemory parallel machine which allows for messagepassing of data between independently executing processors
