Chang Fangzhe, Ma Tianyue, Wang Dantong, Zhu Shoujie, Li Dengping, Feng Shuntian, Fan Xiaoyong
Hebei Provincial Communication Planning, Design, and Research Institute Co., Ltd Town Renewal Research Center, Shijiazhuang, China.
School of Geoscience and Surveying Engineering, China University of Mining and Technology (Beijing), Beijing, China.
PLoS One. 2025 Mar 18;20(3):e0317106. doi: 10.1371/journal.pone.0317106. eCollection 2025.
To address challenges in remote sensing images, such as the abundance of buildings, difficulty in contour extraction, and slow update speeds, a high-resolution remote sensing image building segmentation and extraction method based on the YOLOv5ds network structure was proposed using Gaofen-2 images. This method, named YOLOv5ds-RC, comprises three primary components: target detection, semantic segmentation, and edge optimization. In the semantic segmentation module, an upsampling and multiple convolutional layers branch out from the second feature fusion layer of the Feature Pyramid Networks (FPN), producing a category mapping image that matches the original image size. For edge optimization, a Raster compression module is incorporated at the end of the segmentation network to refine the segmentation contours. This approach enables effective segmentation of Gaofen-2 images, achieving detailed results at the individual building scale across urban areas and facilitating rapid contour optimization and extraction. Experimental results indicate that YOLOv5ds-RC achieves an accuracy of 0.8849, a recall of 0.63904, an average precision (AP) at 0.5 of 0.75863, and a mean average precision (mAP) from 0.5 to 0.95 of 0.47388. These metrics significantly surpass those of the original YOLOv5ds, which recorded values of 0.81483 for accuracy, 0.51332 for recall, 0.63552 for AP at 0.5, and 0.34922 for mAP. The algorithm effectively corrects target displacement deviations in non-orthogonal images and achieves more objective and accurate contour extraction, meeting the requirements for rapid extraction. Due to these features, YOLOv5ds-RC can further enhance fully automated rapid extraction and historical change analysis in land use change monitoring.
为解决遥感影像中存在的诸如建筑物众多、轮廓提取困难以及更新速度缓慢等挑战,利用高分二号影像提出了一种基于YOLOv5ds网络结构的高分辨率遥感影像建筑物分割与提取方法。该方法名为YOLOv5ds-RC,由三个主要部分组成:目标检测、语义分割和边缘优化。在语义分割模块中,从特征金字塔网络(FPN)的第二个特征融合层分支出来一个上采样和多个卷积层,生成一个与原始图像大小匹配的类别映射图像。对于边缘优化,在分割网络的末尾加入了一个光栅压缩模块来细化分割轮廓。这种方法能够对高分二号影像进行有效分割,在城市区域的单个建筑物尺度上实现详细的结果,并有助于快速进行轮廓优化和提取。实验结果表明,YOLOv5ds-RC的准确率为0.8849,召回率为0.63904,0.5的平均精度(AP)为0.75863,0.5到0.95的平均平均精度(mAP)为0.47388。这些指标显著超过了原始YOLOv5ds的指标,原始YOLOv5ds的准确率为0.81483,召回率为0.51332,0.5的AP为0.63552,mAP为0.34922。该算法有效校正了非正交影像中的目标位移偏差,实现了更客观准确的轮廓提取,满足快速提取的要求。由于这些特性,YOLOv5ds-RC可以进一步增强土地利用变化监测中的全自动快速提取和历史变化分析。