Xu Yonghao, Ghamisi Pedram
IEEE Trans Image Process. 2022;31:5038-5051. doi: 10.1109/TIP.2022.3189825. Epub 2022 Aug 2.
Deep learning algorithms have obtained great success in semantic segmentation of very high-resolution (VHR) remote sensing images. Nevertheless, training these models generally requires a large amount of accurate pixel-wise annotations, which is very laborious and time-consuming to collect. To reduce the annotation burden, this paper proposes a consistency-regularized region-growing network (CRGNet) to achieve semantic segmentation of VHR remote sensing images with point-level annotations. The key idea of CRGNet is to iteratively select unlabeled pixels with high confidence to expand the annotated area from the original sparse points. However, since there may exist some errors and noises in the expanded annotations, directly learning from them may mislead the training of the network. To this end, we further propose the consistency regularization strategy, where a base classifier and an expanded classifier are employed. Specifically, the base classifier is supervised by the original sparse annotations, while the expanded classifier aims to learn from the expanded annotations generated by the base classifier with the region-growing mechanism. The consistency regularization is thereby achieved by minimizing the discrepancy between the predictions from both the base and the expanded classifiers. We find such a simple regularization strategy is yet very useful to control the quality of the region-growing mechanism. Extensive experiments on two benchmark datasets demonstrate that the proposed CRGNet significantly outperforms the existing state-of-the-art methods. Codes and pre-trained models are available online (https://github.com/YonghaoXu/CRGNet).
深度学习算法在超高分辨率(VHR)遥感图像的语义分割方面取得了巨大成功。然而,训练这些模型通常需要大量精确的逐像素标注,而收集这些标注非常费力且耗时。为减轻标注负担,本文提出一种一致性正则化区域生长网络(CRGNet),以利用点级标注实现VHR遥感图像的语义分割。CRGNet的关键思想是迭代选择具有高置信度的未标注像素,从原始稀疏点开始扩展标注区域。然而,由于扩展后的标注中可能存在一些错误和噪声,直接从它们进行学习可能会误导网络训练。为此,我们进一步提出一致性正则化策略,其中使用了一个基础分类器和一个扩展分类器。具体而言,基础分类器由原始稀疏标注监督,而扩展分类器旨在从通过区域生长机制由基础分类器生成的扩展标注中进行学习。通过最小化基础分类器和扩展分类器预测之间的差异来实现一致性正则化。我们发现这样一种简单的正则化策略对于控制区域生长机制的质量非常有用。在两个基准数据集上进行的大量实验表明,所提出的CRGNet显著优于现有的最先进方法。代码和预训练模型可在线获取(https://github.com/YonghaoXu/CRGNet)。