Liu Ping, Gao Yu, Zheng Xiangtian, Wang Hesong, Zhao Yimeng, Wu Xinru, Lu Zehao, Yue Zhichuan, Xie Yuting, Hao Shufeng
College of Artificial Intelligence, Taiyuan University of Technology, Jinzhong, Shanxi, China.
Business School of Northeast Normal University, Northeast Normal University, Changchun, Jilin, China.
Front Neurorobot. 2025 Jan 14;18:1482051. doi: 10.3389/fnbot.2024.1482051. eCollection 2024.
Accurate building segmentation has become critical in various fields such as urban management, urban planning, mapping, and navigation. With the increasing diversity in the number, size, and shape of buildings, convolutional neural networks have been used to segment and extract buildings from such images, resulting in increased efficiency and utilization of image features. We propose a building semantic segmentation method to improve the traditional Unet convolutional neural network by integrating attention mechanism and boundary detection. The attention mechanism module combines attention in the channel and spatial dimensions. The module captures image feature information in the channel dimension using a one-dimensional convolutional cross-channel method and automatically adjusts the cross-channel dimension using adaptive convolutional kernel size. Additionally, a weighted boundary loss function is designed to replace the traditional semantic segmentation cross-entropy loss to detect the boundary of a building. The loss function optimizes the extraction of building boundaries in backpropagation, ensuring the integrity of building boundary extraction in the shadow part. Experimental results show that the proposed model AMBDNet achieves high-performance metrics, including a recall rate of 0.9046, an IoU of 0.7797, and a pixel accuracy of 0.9140 on high-resolution remote sensing images, demonstrating its robustness and effectiveness in precise building segmentation. Experimental results further indicate that AMBDNet improves the single-class recall of buildings by 0.0322 and the single-class pixel accuracy by 0.0169 in the high-resolution remote sensing image recognition task.
精确的建筑物分割在城市管理、城市规划、测绘和导航等各个领域已变得至关重要。随着建筑物数量、大小和形状的多样性不断增加,卷积神经网络已被用于从此类图像中分割和提取建筑物,从而提高了图像特征的效率和利用率。我们提出了一种建筑物语义分割方法,通过整合注意力机制和边界检测来改进传统的Unet卷积神经网络。注意力机制模块在通道和空间维度上结合了注意力。该模块使用一维卷积跨通道方法在通道维度上捕获图像特征信息,并使用自适应卷积核大小自动调整跨通道维度。此外,设计了加权边界损失函数来替代传统的语义分割交叉熵损失,以检测建筑物的边界。该损失函数在反向传播中优化建筑物边界的提取,确保阴影部分建筑物边界提取的完整性。实验结果表明,所提出的模型AMBDNet在高分辨率遥感图像上实现了高性能指标,包括召回率为0.9046、交并比为0.7797和像素准确率为0.9140,证明了其在精确建筑物分割中的鲁棒性和有效性。实验结果进一步表明,在高分辨率遥感图像识别任务中,AMBDNet将建筑物的单类召回率提高了0.0322,单类像素准确率提高了0.0169。