Authors
Zhi-Hua Zhou, Xu-Ying Liu
Publication date
2006/1
Journal
IEEE Trans. Knowledge and Data Engineering
Volume
18
Issue
1
Pages
63-77
Publisher
IEEE
Description
This paper studies empirically the effect of sampling and threshold-moving in training cost-sensitive neural networks. Both oversampling and undersampling are considered. These techniques modify the distribution of the training data such that the costs of the examples are conveyed explicitly by the appearances of the examples. Threshold-moving tries to move the output threshold toward inexpensive classes such that examples with higher costs become harder to be misclassified. Moreover, hard-ensemble and soft-ensemble, i.e., the combination of above techniques via hard or soft voting schemes, are also tested. Twenty-one UCl data sets with three types of cost matrices and a real-world cost-sensitive data set are used in the empirical study. The results suggest that cost-sensitive learning with multiclass tasks is more difficult than with two-class tasks, and a higher degree of class imbalance may increase the …
Total citations
200620072008200920102011201220132014201520162017201820192020202120222023202471933454758558886661009112813217314514812363
Scholar articles