Authors
Matthew Eric Otey, Srinivasan Parthasarathy, Chao Wang, Adriano Veloso, Wagner Meira
Publication date
2004/11/15
Journal
IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)
Volume
34
Issue
6
Pages
2439-2450
Publisher
IEEE
Description
Traditional methods for data mining typically make the assumption that the data is centralized, memory-resident, and static. This assumption is no longer tenable. Such methods waste computational and input/output (I/O) resources when data is dynamic, and they impose excessive communication overhead when data is distributed. Efficient implementation of incremental data mining methods is, thus, becoming crucial for ensuring system scalability and facilitating knowledge discovery when data is dynamic and distributed. In this paper, we address this issue in the context of the important task of frequent itemset mining. We first present an efficient algorithm which dynamically maintains the required information even in the presence of data updates without examining the entire dataset. We then show how to parallelize this incremental algorithm. We also propose a distributed asynchronous algorithm, which imposes …
Total citations
200420052006200720082009201020112012201320142015201620172018201920202021202220231261024564254532421
Scholar articles
ME Otey, S Parthasarathy, C Wang, A Veloso, W Meira - IEEE Transactions on Systems, Man, and Cybernetics …, 2004