Authors
Amit Shukla, Prasad Deshpande, Jeffrey F Naughton
Publication date
1998/8/24
Journal
VLDB
Volume
98
Pages
488-499
Description
To fulfill the requirement of fast interactive multidimensional data analysis, database systems precompute aggregate views on some subsets of dimensions and their corresponding hierarchies. However, the problem of what to precompute is difficult and intriguing. The leading existing algorithm, BPUS, has a running time that is polynomial in the number of views and is guaranteed to be within (0.63-f) of optimal, where f is the fraction of available space consumed by the largest aggregate. Unfortunately, BPUS can be impractically slow, and in some instances may miss good solutions due to the coarse granularity at which it makes its decisions of what to precompute. In view of this, we study the structure of the precomputation problem and show that under certain broad conditions on the multidimensional data, an even simpler and faster algorithm, PBS, achieves the same (0.63-f) bound. Our empirical study of the behavior of PBS shows that even when this condition does not hold, PBS picks a surprisingly good set of aggregates for precomputation. Furthermore, BPUS and other previous work has assumed that all aggregates are either precomputed in their entirety or not at all. We show that if one relaxes this and allows aggregates to be partially precomputed, not only is it possible to find solutions that are better than those found by previous algorithms, in some cases it is even possible to find solutions that are better than the solution that is ‘optimal’by the previous definition.
Total citations
19971998199920002001200220032004200520062007200820092010201120122013201420152016201720182019202020212022202320242213212623153117302718251623232510191614101153542
Scholar articles