Authors
Maciej Jaworski, Piotr Duda, Leszek Rutkowski
Publication date
2017/5/10
Journal
IEEE transactions on neural networks and learning systems
Volume
29
Issue
6
Pages
2516-2529
Publisher
IEEE
Description
The most popular tools for stream data mining are based on decision trees. In previous 15 years, all designed methods, headed by the very fast decision tree algorithm, relayed on Hoeffding's inequality and hundreds of researchers followed this scheme. Recently, we have demonstrated that although the Hoeffding decision trees are an effective tool for dealing with stream data, they are a purely heuristic procedure; for example, classical decision trees such as ID3 or CART cannot be adopted to data stream mining using Hoeffding's inequality. Therefore, there is an urgent need to develop new algorithms, which are both mathematically justified and characterized by good performance. In this paper, we address this problem by developing a family of new splitting criteria for classification in stationary data streams and investigating their probabilistic properties. The new criteria, derived using appropriate statistical tools …
Total citations
201720182019202020212022202320242191828179128
Scholar articles
M Jaworski, P Duda, L Rutkowski - IEEE transactions on neural networks and learning …, 2017