Yan Pingping, Qi Xiangming, Jiang Liang
Liaoning Technical University, School of Software, Huludao, Liaoning, China.
Tarim University, School of Information Engineering, Alar, Xinjiang, China.
PLoS One. 2025 May 23;20(5):e0321026. doi: 10.1371/journal.pone.0321026. eCollection 2025.
In the domain of remote sensing image small target detection, challenges such as difficulties in extracting features of small targets, complex backgrounds that easily lead to confusion with targets, and high computational complexity with significant resource consumption are prevalent. We propose a lightweight small target detection algorithm for remote sensing images that combines GSConv and PConv, named LI-YOLOv8. Using YOLOv8n as the baseline algorithm, the activation function SiLU in the CBS at the backbone network's SPPF is replaced with ReLU, which reduces interdependencies among parameters. Then, RFAConv is embedded after the first CBS to expand the receptive field and extract more features of small targets. An efficient Multi-Scale Attention (EMA) mechanism is embedded at the terminal of C2f within the neck network to integrate more detailed information, enhancing the focus on small targets. The head network incorporates a lightweight detection head, GP-Detect, which combines GSConv and PConv to decrease the parameter count and computational demand. Integrating Inner-IoU and Wise-IoU v3 to design the Inner-Wise IoU loss function, replacing the original CIoU loss function. This approach provides the algorithm with a gain distribution strategy, focuses on anchor boxes of ordinary quality, and strengthens generalization capability. We conducted ablation and comparative experiments on the public datasets RSOD and NWPU VHR-10. Compared to YOLOv8, our approach achieved improvements of 7.6% and 2.8% in mAP@0.5, and increases of 2.1% and 1.1% in mAP@0.5:0.95. Furthermore, Parameters and GFLOPs were reduced by 10.0% and 23.2%, respectively, indicating a significant enhancement in detection accuracy along with a substantial decrease in both parameters and computational costs. Generalization experiments were conducted on the TinyPerson, LEVIR-ship, brain-tumor, and smoke_fire_1 datasets. The mAP@0.5 metric improved by 2.6%, 5.3%, 2.6%, and 2.3%, respectively, demonstrating the algorithm's robust performance.
在遥感图像小目标检测领域,存在诸多挑战,如小目标特征提取困难、复杂背景易导致与目标混淆、计算复杂度高且资源消耗大等问题普遍存在。我们提出了一种用于遥感图像的轻量级小目标检测算法,该算法结合了GSConv和PConv,名为LI-YOLOv8。以YOLOv8n作为基线算法,将主干网络SPPF中CBS的激活函数SiLU替换为ReLU,减少了参数之间的相互依赖。然后,在第一个CBS之后嵌入RFAConv以扩大感受野并提取更多小目标特征。在颈部网络的C2f末端嵌入高效的多尺度注意力(EMA)机制,以整合更详细的信息,增强对小目标的关注。头部网络包含一个轻量级检测头GP-Detect,它结合了GSConv和PConv以减少参数数量和计算需求。整合Inner-IoU和Wise-IoU v3来设计Inner-Wise IoU损失函数,取代原来的CIoU损失函数。这种方法为算法提供了增益分配策略,关注普通质量的锚框,并增强了泛化能力。我们在公共数据集RSOD和NWPU VHR-10上进行了消融实验和对比实验。与YOLOv8相比,我们的方法在mAP@0.5上分别提高了7.6%和2.8%,在mAP@0.5:0.95上分别提高了2.1%和1.1%。此外,参数和GFLOPs分别减少了10.0%和23.2%,表明检测精度显著提高,同时参数和计算成本大幅降低。在TinyPerson、LEVIR-ship、脑肿瘤和smoke_fire_1数据集上进行了泛化实验。mAP@0.5指标分别提高了2.6%、5.3%、2.6%和2.3%,证明了该算法的稳健性能。