Department of Geographic Information, Information Engineering University, Wutong Street High-Tech District, Zhengzhou 450001, China.
School of Computer Science & Technology, Beijing Institute of Technology, Haidian District, Beijing 100081, China.
Sensors (Basel). 2023 Jan 21;23(3):1258. doi: 10.3390/s23031258.
An appropriate detection network is required to extract building information in remote sensing images and to relieve the issue of poor detection effects resulting from the deficiency of detailed features. Firstly, we embed a transposed convolution sampling module fusing multiple normalization activation layers in the decoder based on the SegFormer network. This step alleviates the issue of missing feature semantics by adding holes and fillings, cascading multiple normalizations and activation layers to hold back over-fitting regularization expression and guarantee steady feature parameter classification. Secondly, the atrous spatial pyramid pooling decoding module is fused to explore multi-scale contextual information and to overcome issues such as the loss of detailed information on local buildings and the lack of long-distance information. Ablation experiments and comparison experiments are performed on the remote sensing image AISD, MBD, and WHU dataset. The robustness and validity of the improved mechanism are demonstrated by control groups of ablation experiments. In comparative experiments with the HRnet, PSPNet, U-Net, DeepLabv3+ networks, and the original detection algorithm, the mIoU of the AISD, the MBD, and the WHU dataset is enhanced by 17.68%, 30.44%, and 15.26%, respectively. The results of the experiments show that the method of this paper is superior to comparative methods such as U-Net. Furthermore, it is better for integrity detection of building edges and reduces the number of missing and false detections.
需要一个合适的检测网络来提取遥感图像中的建筑物信息,并缓解由于详细特征不足导致的检测效果不佳的问题。首先,我们在基于 SegFormer 网络的解码器中嵌入了一个转置卷积采样模块,融合了多个归一化激活层。通过添加空洞和填充,级联多个归一化和激活层来阻止过拟合正则化表达式,保证特征参数分类的稳定性,从而缓解特征语义缺失的问题。其次,融合了空洞空间金字塔池化解码模块来探索多尺度上下文信息,克服局部建筑物细节信息丢失和远距离信息缺乏等问题。在遥感图像 AISD、MBD 和 WHU 数据集上进行了消融实验和对比实验。通过消融实验的对照组证明了改进机制的稳健性和有效性。在与 HRnet、PSPNet、U-Net、DeepLabv3+网络和原始检测算法的对比实验中,AISD、MBD 和 WHU 数据集的 mIoU 分别提高了 17.68%、30.44%和 15.26%。实验结果表明,本文方法优于 U-Net 等对比方法,并且能够更好地进行建筑物边缘的完整性检测,减少漏检和误检的数量。