Zhang PengLei, Liu Yanhong
Software College, Henan University, KaiFeng, Henan, China.
PeerJ Comput Sci. 2024 Apr 29;10:e2007. doi: 10.7717/peerj-cs.2007. eCollection 2024.
Uncrewed aerial vehicle (UAV) aerial photography technology is widely used in both industrial and military sectors, but remote sensing for small target detection still faces several challenges. Firstly, the small size of targets increases the difficulty of detection and recognition. Secondly, complex aerial environmental conditions, such as lighting changes and background noise, significantly affect the quality of detection. Rapid and accurate identification of target categories is also a key issue, requiring improvements in detection speed and accuracy. This study proposes an improved remote sensing target detection algorithm based on the YOLOv5 architecture. In the YOLOv5s model, the Distribution Focal Loss function is introduced to accelerate the convergence speed of the network and enhance the network's focus on annotated data. Simultaneously, adjustments are made to the Cross Stage Partial (CSP) network structure, modifying the convolution kernel size, adding a new stack-separated convolution module, and designing a new attention mechanism to achieve effective feature fusion between different hierarchical structure feature maps. Experimental results demonstrate a significant performance improvement of the proposed algorithm on the RSOD dataset, with a 3.5% increase in detection accuracy compared to the original algorithm. These findings indicate that our algorithm effectively enhances the precision of remote sensing target detection and holds potential application prospects.
无人机(UAV)航空摄影技术在工业和军事领域都有广泛应用,但用于小目标检测的遥感仍面临若干挑战。首先,目标尺寸小增加了检测和识别的难度。其次,复杂的空中环境条件,如光照变化和背景噪声,会显著影响检测质量。快速准确地识别目标类别也是一个关键问题,需要提高检测速度和准确性。本研究提出了一种基于YOLOv5架构的改进遥感目标检测算法。在YOLOv5s模型中,引入了分布焦点损失函数以加快网络收敛速度,并增强网络对标注数据的关注。同时,对跨阶段局部(CSP)网络结构进行调整,修改卷积核大小,添加新的堆叠分离卷积模块,并设计新的注意力机制,以实现不同层次结构特征图之间的有效特征融合。实验结果表明,该算法在RSOD数据集上的性能有显著提升,与原算法相比,检测准确率提高了3.5%。这些结果表明,我们的算法有效地提高了遥感目标检测的精度,并具有潜在的应用前景。