Authors
Dong Xin, Zheng Shao, Jiawei Han, Hongyan Liu
Publication date
2006/4/3
Conference
22nd International Conference on Data Engineering (ICDE'06)
Pages
4-4
Publisher
IEEE
Description
It is well recognized that data cubing often produces huge outputs. Two popular efforts devoted to this problem are (1) iceberg cube, where only significant cells are kept, and (2) closed cube, where a group of cells which preserve roll-up/drill-down semantics are losslessly compressed to one cell. Due to its usability and importance, efficient computation of closed cubes still warrants a thorough study. In this paper, we propose a new measure, called closedness, for efficient closed data cubing. We show that closedness is an algebraic measure and can be computed efficiently and incrementally. Based on closedness measure, we develop an an aggregation-based approach, called C-Cubing (i.e., Closed-Cubing), and integrate it into two successful iceberg cubing algorithms: MM-Cubing and Star-Cubing. Our performance study shows that C-Cubing runs almost one order of magnitude faster than the previous …
Total citations
2006200720082009201020112012201320142015201620172018201920202021202257615779423331311
Scholar articles
D Xin, Z Shao, J Han, H Liu - 22nd International Conference on Data Engineering …, 2006