View article

[PDF] from ucsb.edu

Summarizing itemset patterns: a profile-based approach

Authors

Xifeng Yan, Hong Cheng, Jiawei Han, Dong Xin

Publication date

2005/8/21

Book

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining

Pages

314-323

Description

Frequent-pattern mining has been studied extensively on scalable methods for mining various kinds of patterns including itemsets, sequences, and graphs. However, the bottleneck of frequent-pattern mining is not at the efficiency but at the interpretability, due to the huge number of patterns generated by the mining process.In this paper, we examine how to summarize a collection of itemset patterns using only K representatives, a small number of patterns that a user can handle easily. The K representatives should not only cover most of the frequent patterns but also approximate their supports. A generative model is built to extract and profile these representatives, under which the supports of the patterns can be easily recovered without consulting the original dataset. Based on the restoration error, we propose a quality measure function to determine the optimal value of parameter K. Polynomial time algorithms are …

Total citations

Cited by 279

200520062007200820092010201120122013201420152016201720182019202020212022202320242 18 24 17 25 27 21 20 16 26 15 11 14 13 10 7 4 8 1

Scholar articles

Summarizing itemset patterns: a profile-based approach

X Yan, H Cheng, J Han, D Xin - Proceedings of the eleventh ACM SIGKDD …, 2005