Luo Hao, Wei Jiangshu, Wang Yuchao, Chen Jinrong, Li Wujie
College of Information Engineering, Sichuan Agricultural University, Ya'an, Sichuan, China.
College of Mechanical and Electrical Engineering, Sichuan Agricultural University, Ya'an, Sichuan, China.
PeerJ Comput Sci. 2024 Jan 30;10:e1830. doi: 10.7717/peerj-cs.1830. eCollection 2024.
Object detection based on deep learning has made great progress in the past decade and has been widely used in various fields of daily life. Model lightweighting is the core of deploying target detection models on mobile or edge devices. Lightweight models have fewer parameters and lower computational costs, but are often accompanied by lower detection accuracy. Based on YOLOv5s, this article proposes an improved lightweight target detection model, which can achieve higher detection accuracy with smaller parameters. Firstly, utilizing the lightweight feature of the Ghost module, we integrated it into the C3 structure and replaced some of the C3 modules after the upsample layer on the neck network, thereby reducing the number of model parameters and expediting the model's inference process. Secondly, the coordinate attention (CA) mechanism was added to the neck to enhance the model's ability to pay attention to relevant information and improved detection accuracy. Finally, a more efficient Simplified Spatial Pyramid Pooling-Fast (SimSPPF) module was designed to enhance the stability of the model and shorten the training time of the model. In order to verify the effectiveness of the improved model, experiments were conducted using three datasets with different features. Experimental results show that the number of parameters of our model is significantly reduced by 28% compared with the original model, and mean average precision (mAP) is increased by 3.1%, 1.1% and 1.8% respectively. The model also performs better in terms of accuracy compared to existing lightweight state-of-the-art models. On three datasets with different features, mAP of the proposed model achieved 87.2%, 77.8% and 92.3%, which is better than YOLOv7tiny (81.4%, 77.7%, 90.3%), YOLOv8n (84.7%, 77.7%, 90.6%) and other advanced models. When achieving the decreased number of parameters, the improved model can successfully increase mAP, providing great reference for deploying the model on mobile or edge devices.
在过去十年中,基于深度学习的目标检测取得了巨大进展,并已广泛应用于日常生活的各个领域。模型轻量化是在移动或边缘设备上部署目标检测模型的核心。轻量化模型具有较少的参数和较低的计算成本,但往往伴随着较低的检测精度。本文基于YOLOv5s提出了一种改进的轻量化目标检测模型,该模型能够以更少的参数实现更高的检测精度。首先,利用Ghost模块的轻量化特性,将其集成到C3结构中,并替换了颈部网络上采样层后的部分C3模块,从而减少了模型参数数量,加快了模型的推理过程。其次,在颈部添加了坐标注意力(CA)机制,以增强模型关注相关信息的能力,提高检测精度。最后,设计了一个更高效的简化空间金字塔池化快速(SimSPPF)模块,以增强模型的稳定性并缩短模型的训练时间。为了验证改进模型的有效性,使用了三个具有不同特征的数据集进行实验。实验结果表明,与原始模型相比,我们模型的参数数量显著减少了28%,平均精度均值(mAP)分别提高了3.1%、1.1%和1.8%。与现有的轻量化先进模型相比,该模型在准确性方面也表现更好。在所提出模型在三个具有不同特征的数据集上,mAP分别达到了87.2%、77.8%和92.3%,优于YOLOv7tiny(81.4%、77.7%、90.3%)、YOLOv8n(84.7%、77.7%、90.6%)等先进模型。在实现参数数量减少的情况下,改进后的模型能够成功提高mAP,为在移动或边缘设备上部署该模型提供了重要参考。