College of Mathematics and Computer Science, Zhejiang A and F University, Hangzhou 311300, China.
Key Laboratory of Forestry Intelligent Monitoring and Information Technology of Zhejiang Province, Hangzhou 311300, China.
Sensors (Basel). 2022 Oct 2;22(19):7477. doi: 10.3390/s22197477.
Ground-object classification using remote-sensing images of high resolution is widely used in land planning, ecological monitoring, and resource protection. Traditional image segmentation technology has poor effect on complex scenes in high-resolution remote-sensing images. In the field of deep learning, some deep neural networks are being applied to high-resolution remote-sensing image segmentation. The DeeplabV3+ network is a deep neural network based on encoder-decoder architecture, which is commonly used to segment images with high precision. However, the segmentation accuracy of high-resolution remote-sensing images is poor, the number of network parameters is large, and the cost of training network is high. Therefore, this paper improves the DeeplabV3+ network. Firstly, MobileNetV2 network was used as the backbone feature-extraction network, and an attention-mechanism module was added after the feature-extraction module and the ASPP module to introduce focal loss balance. Our design has the following advantages: it enhances the ability of network to extract image features; it reduces network training costs; and it achieves better semantic segmentation accuracy. Experiments on high-resolution remote-sensing image datasets show that the mIou of the proposed method on WHDLD datasets is 64.76%, 4.24% higher than traditional DeeplabV3+ network mIou, and the mIou on CCF BDCI datasets is 64.58%. This is 5.35% higher than traditional DeeplabV3+ network mIou and outperforms traditional DeeplabV3+, U-NET, PSP-NET and MACU-net networks.
使用高分辨率遥感图像进行地物分类广泛应用于土地规划、生态监测和资源保护。传统的图像分割技术对高分辨率遥感图像中的复杂场景效果不佳。在深度学习领域,一些深度学习网络被应用于高分辨率遥感图像分割。DeeplabV3+ 网络是一种基于编解码器结构的深度学习网络,常用于高精度图像分割。但是,高分辨率遥感图像的分割精度较差,网络参数数量大,训练网络的成本高。因此,本文对 DeeplabV3+网络进行了改进。首先,使用 MobileNetV2 网络作为骨干特征提取网络,在特征提取模块和 ASPP 模块之后添加注意力机制模块,引入焦点损失平衡。我们的设计具有以下优点:增强了网络提取图像特征的能力;降低了网络训练成本;实现了更好的语义分割精度。在高分辨率遥感图像数据集上的实验表明,所提出的方法在 WHDLD 数据集上的 mIou 为 64.76%,比传统的 DeeplabV3+网络 mIou 高 4.24%,在 CCF BDCI 数据集上的 mIou 为 64.58%,比传统的 DeeplabV3+网络 mIou 高 5.35%,优于传统的 DeeplabV3+、U-NET、PSP-NET 和 MACU-net 网络。