Liu Bingqi, Mo Peijun, Wang Shengzhe, Cui Yuyong, Wu Zhongjian
Norla Institute of Technical Physics, Chengdu 610041, China.
School of Mechanical Engineering, Chengdu University, Chengdu 610106, China.
Sensors (Basel). 2024 Nov 8;24(22):7166. doi: 10.3390/s24227166.
Remote sensing object detection (RSOD) plays a crucial role in resource utilization, geological disaster risk assessment and urban planning. Deep learning-based object-detection algorithms have proven effective in remote sensing image studies. However, accurate detection of objects with small size, dense distribution and complex object arrangement remains a significant challenge in the remote sensing field. To address this, a refined and efficient object-detection algorithm (RE-YOLO) has been proposed in this paper for remote sensing images. Initially, a refined and efficient module (REM) was designed to balance computational complexity and feature-extraction capabilities, which serves as a key component of the RE_CSP block. RE_CSP block efficiently extracts multi-scale information, overcoming challenges posed by complex backgrounds. Moreover, the spatial extracted attention module (SEAM) has been proposed in the bottleneck of backbone to promote representative feature learning and enhance the semantic information capture. In addition, a three-branch path aggregation network (TBPAN) has been constructed as the neck network, which facilitates comprehensive fusion of shallow positional information and deep semantic information across different channels, enabling the network with a robust ability to capture contextual information. Extensive experiments conducted on two large-scale remote sensing datasets, DOTA-v1.0 and SCERL, demonstrate that the proposed RE-YOLO outperforms state-of-the-art other object-detection approaches and exhibits a significant improvement in generalization ability.
遥感目标检测(RSOD)在资源利用、地质灾害风险评估和城市规划中起着至关重要的作用。基于深度学习的目标检测算法在遥感图像研究中已被证明是有效的。然而,在遥感领域,准确检测小尺寸、密集分布和复杂物体排列的物体仍然是一个重大挑战。为了解决这个问题,本文针对遥感图像提出了一种改进的高效目标检测算法(RE-YOLO)。最初,设计了一个改进的高效模块(REM)来平衡计算复杂度和特征提取能力,它是RE_CSP模块的关键组成部分。RE_CSP模块有效地提取多尺度信息,克服了复杂背景带来的挑战。此外,在主干网络的瓶颈处提出了空间提取注意力模块(SEAM),以促进代表性特征学习并增强语义信息捕获。此外,构建了一个三分支路径聚合网络(TBPAN)作为颈部网络,它有助于跨不同通道全面融合浅层位置信息和深层语义信息,使网络具有强大的上下文信息捕获能力。在两个大规模遥感数据集DOTA-v1.0和SCERL上进行的大量实验表明,所提出的RE-YOLO优于其他先进的目标检测方法,并且在泛化能力方面有显著提高。