Authors
Josey Mathew, Ming Luo, Chee Khiang Pang, Hian Leng Chan
Publication date
2015/11/9
Conference
IECON 2015-41st Annual Conference of the IEEE Industrial Electronics Society
Pages
001127-001132
Publisher
IEEE
Description
Datasets with an imbalanced class distribution pose a severe challenge to traditional learning algorithms that are designed to improve overall classification accuracy. Preprocessing methods like Synthetic Minority Over-sampling Technique (SMOTE) address this problem by generating data points in the input space to balance the training dataset. However, such artificial sampling methods can distort the performance of Support Vector Machine (SVM) classifiers that operate in a kernel induced feature space. This paper proposes a kernel-based SMOTE (K-SMOTE) algorithm that directly generates synthetically minority data points in the feature space of SVM classifier. The new data points are added by augmenting the original Gram matrix based on neighbourhood information in the feature space. The proposed algorithm is statistically shown to improve performance on 51 benchmark datasets. K-SMOTE is further …
Total citations
20162017201820192020202120222023202423109172117149
Scholar articles
J Mathew, M Luo, CK Pang, HL Chan - IECON 2015-41st Annual Conference of the IEEE …, 2015