College of Computer and Information, Hohai University, Nanjing, Jiangsu, China.
Water History Department, China Institute of Water Resources and Hydropower Research, Beijing, China.
PLoS One. 2024 May 14;19(5):e0301134. doi: 10.1371/journal.pone.0301134. eCollection 2024.
Land cover classification (LCC) is of paramount importance for assessing environmental changes in remote sensing images (RSIs) as it involves assigning categorical labels to ground objects. The growing availability of multi-source RSIs presents an opportunity for intelligent LCC through semantic segmentation, offering a comprehensive understanding of ground objects. Nonetheless, the heterogeneous appearances of terrains and objects contribute to significant intra-class variance and inter-class similarity at various scales, adding complexity to this task. In response, we introduce SLMFNet, an innovative encoder-decoder segmentation network that adeptly addresses this challenge. To mitigate the sparse and imbalanced distribution of RSIs, we incorporate selective attention modules (SAMs) aimed at enhancing the distinguishability of learned representations by integrating contextual affinities within spatial and channel domains through a compact number of matrix operations. Precisely, the selective position attention module (SPAM) employs spatial pyramid pooling (SPP) to resample feature anchors and compute contextual affinities. In tandem, the selective channel attention module (SCAM) concentrates on capturing channel-wise affinity. Initially, feature maps are aggregated into fewer channels, followed by the generation of pairwise channel attention maps between the aggregated channels and all channels. To harness fine-grained details across multiple scales, we introduce a multi-level feature fusion decoder with data-dependent upsampling (MLFD) to meticulously recover and merge feature maps at diverse scales using a trainable projection matrix. Empirical results on the ISPRS Potsdam and DeepGlobe datasets underscore the superior performance of SLMFNet compared to various state-of-the-art methods. Ablation studies affirm the efficacy and precision of SAMs in the proposed model.
土地覆盖分类(LCC)对于评估遥感图像(RSIs)中的环境变化至关重要,因为它涉及到将类别标签分配给地面物体。多源 RSIs 的日益普及为通过语义分割进行智能 LCC 提供了机会,从而可以全面了解地面物体。然而,地形和物体的异质性导致在不同尺度上存在显著的类内方差和类间相似性,这增加了任务的复杂性。为了解决这个问题,我们引入了 SLMFNet,这是一种创新的编码器-解码器分割网络。为了减轻 RSIs 的稀疏和不平衡分布,我们引入了选择性注意模块(SAM),旨在通过通过紧凑的矩阵运算在空间和通道域内整合上下文亲和力,来增强学习表示的可区分性。具体来说,选择性位置注意模块(SPAM)使用空间金字塔池化(SPP)对特征锚点进行重采样,并计算上下文亲和力。同时,选择性通道注意模块(SCAM)专注于捕获通道间的亲和力。最初,特征图被聚合到较少的通道中,然后在聚合通道和所有通道之间生成成对的通道注意力图。为了在多个尺度上利用细粒度的细节,我们引入了一个具有数据依赖上采样的多级特征融合解码器(MLFD),使用可训练的投影矩阵来精细地恢复和合并不同尺度的特征图。在 ISPRS Potsdam 和 DeepGlobe 数据集上的实验结果表明,SLMFNet 优于各种最先进的方法。消融研究证实了 SAM 在提出的模型中的有效性和精度。