Authors
Clifton Phua, Damminda Alahakoon, Vincent Lee
Publication date
2004/6/1
Journal
Acm sigkdd explorations newsletter
Volume
6
Issue
1
Pages
50-59
Publisher
ACM
Description
This paper proposes an innovative fraud detection method, built upon existing fraud detection research and Minority Report, to deal with the data mining problem of skewed data distributions. This method uses backpropagation (BP), together with naive Bayesian (NB) and C4.5 algorithms, on data partitions derived from minority oversampling with replacement. Its originality lies in the use of a single meta-classifier (stacking) to choose the best base classifiers, and then combine these base classifiers' predictions (bagging) to improve cost savings (stacking-bagging). Results from a publicly available automobile insurance fraud detection data set demonstrate that stacking-bagging performs slightly better than the best performing bagged algorithm, C4.5, and its best classifier, C4.5 (2), in terms of cost savings. Stacking-bagging also outperforms the common technique used in industry (BP without both sampling and …
Total citations
2005200620072008200920102011201220132014201520162017201820192020202120222023202476212127342840433747525253565038463924
Scholar articles
C Phua, D Alahakoon, V Lee - Acm sigkdd explorations newsletter, 2004