School of Geography and Information Engineering, China University of Geosciences, Wuhan 430078, China.
Academy of Computer, Huanggang Normal University, No. 146 Xinggang 2nd Road, Huanggang 438000, China.
Sensors (Basel). 2020 Nov 23;20(22):6699. doi: 10.3390/s20226699.
Imbalanced learning is a common problem in remote sensing imagery-based land-use and land-cover classifications. Imbalanced learning can lead to a reduction in classification accuracy and even the omission of the minority class. In this paper, an impartial semi-supervised learning strategy based on extreme gradient boosting (ISS-XGB) is proposed to classify very high resolution (VHR) images with imbalanced data. ISS-XGB solves multi-class classification by using several semi-supervised classifiers. It first employs multi-group unlabeled data to eliminate the imbalance of training samples and then utilizes gradient boosting-based regression to simulate the target classes with positive and unlabeled samples. In this study, experiments were conducted on eight study areas with different imbalanced situations. The results showed that ISS-XGB provided a comparable but more stable performance than most commonly used classification approaches (i.e., random forest (RF), XGB, multilayer perceptron (MLP), and support vector machine (SVM)), positive and unlabeled learning (PU-Learning) methods (PU-BP and PU-SVM), and typical synthetic sample-based imbalanced learning methods. Especially under extremely imbalanced situations, ISS-XGB can provide high accuracy for the minority class without losing overall performance (the average overall accuracy achieves 85.92%). The proposed strategy has great potential in solving the imbalanced classification problems in remote sensing.
不平衡学习是基于遥感图像的土地利用和土地覆盖分类中的一个常见问题。不平衡学习可能导致分类精度降低,甚至忽略少数类。本文提出了一种基于极端梯度提升(ISS-XGB)的公正半监督学习策略,用于对不平衡数据的高分辨率(VHR)图像进行分类。ISS-XGB 通过使用多个半监督分类器来解决多类分类问题。它首先使用多组未标记数据来消除训练样本的不平衡,然后利用基于梯度提升的回归来模拟正样本和未标记样本的目标类。本研究在八个具有不同不平衡情况的研究区域进行了实验。结果表明,ISS-XGB 的性能与最常用的分类方法(随机森林(RF)、XGB、多层感知机(MLP)和支持向量机(SVM))、正样本和未标记样本学习方法(PU-BP 和 PU-SVM)以及典型的基于合成样本的不平衡学习方法相当,但更稳定。特别是在极度不平衡的情况下,ISS-XGB 可以为少数类提供高精度,而不会损失整体性能(平均整体准确率达到 85.92%)。该策略在解决遥感中的不平衡分类问题方面具有很大的潜力。