A Novel Neighborhood-Weighted Sampling Method for Imbalanced Datasets

GUANG Mingjian; YAN Chungang; LIU Guanjun; WANG Junli; JIANG Changjun

doi:10.1049/cje.2021.00.121

GUANG Mingjian, YAN Chungang, LIU Guanjun, WANG Junli, JIANG Changjun. A Novel Neighborhood-Weighted Sampling Method for Imbalanced Datasets[J]. Chinese Journal of Electronics, 2022, 31(5): 969-979. DOI: 10.1049/cje.2021.00.121

Citation:

A Novel Neighborhood-Weighted Sampling Method for Imbalanced Datasets

Graphical Abstract

Graphical Abstract

Abstract

Abstract

The weighted sampling methods based on k-nearest neighbors have been demonstrated to be effective in solving the class imbalance problem. However, they usually ignore the positional relationship between a sample and the heterogeneous samples in its neighborhood when calculating sample weight. This paper proposes a novel neighborhood-weighted based sampling method named NWBBagging to improve the Bagging algorithm’s performance on imbalanced datasets. It considers the positional relationship between the center sample and the heterogeneous samples in its neighborhood when identifying critical samples. And a parameter reduction method is proposed and combined into the ensemble learning framework, which reduces the parameters and increases the classifier’s diversity. We compare NWBBagging with some state-of-the-art ensemble learning algorithms on 34 imbalanced datasets, and the result shows that NWBBagging achieves better performance.