Li Hanlin, Liu Ming, Yin Yanfang, Sun Weiliang
Shandong University of Science and Technology, College of Electrical Engineering and Automation, Qingdao, 266590, China.
Sci Rep. 2025 Mar 26;15(1):10371. doi: 10.1038/s41598-024-74601-3.
Detecting defects on steel surfaces is crucial for ensuring product quality and production safety in industrial settings. Object detection using deep learning, particularly the YOLOv5 model, has become a widely adopted method for this purpose. However, the complex shapes of current steel surface defects pose challenges for precise detection, especially when using low-cost recognition devices with small resolution images. To address these challenges, we integrated the RepBi-PAN fusion network into YOLOv5, enhancing the detection capability for large targets in complex backgrounds. To mitigate issues related to the premature introduction of shallow features and decrease in Precision, we optimized the model structure by incorporating the DenseNet structure into the backbone for improved feature extraction. Additionally, we introduced the Normalized Attention Module (NAM) to enhance the detection capability for small targets. Experimental results demonstrate the effectiveness of the enhanced model, showing a 4.1% increase in mean average precision (mAP), a 3.2% improvement in precision, and a 2.4% enhancement in recall. The improved algorithm outperforms the original in complex backgrounds and recognizing small targets, addressing limitations of the Rep-Bi network. Compared to other YOLO algorithms, our approach achieves optimal values for recall and mAP while maintaining a smaller model size. When compared to YOLOv9, which is the best-performing algorithm in the YOLO series on the dataset used in this study, our model demonstrates several advantages. Specifically, it maintains superior overall performance with fewer parameters and lower computational requirements compared to deeper YOLOv9 variants. Furthermore, when compared to YOLOv9s, our model exhibits better performance in terms of precision, recall, and mAP, while also having fewer GFLOPs, a smaller parameter count, and a reduced model size.
检测钢表面的缺陷对于确保工业环境中的产品质量和生产安全至关重要。使用深度学习进行目标检测,特别是YOLOv5模型,已成为实现此目的广泛采用的方法。然而,当前钢表面缺陷的复杂形状对精确检测构成挑战,尤其是在使用具有小分辨率图像的低成本识别设备时。为应对这些挑战,我们将RepBi-PAN融合网络集成到YOLOv5中,增强了在复杂背景下对大目标的检测能力。为缓解与浅层特征过早引入和精度下降相关的问题,我们通过将DenseNet结构纳入主干来优化模型结构,以改进特征提取。此外,我们引入了归一化注意力模块(NAM)来增强对小目标的检测能力。实验结果证明了增强模型的有效性,平均精度均值(mAP)提高了4.1%,精度提高了3.2%,召回率提高了2.4%。改进后的算法在复杂背景和识别小目标方面优于原始算法,解决了Rep-Bi网络的局限性。与其他YOLO算法相比,我们的方法在召回率和mAP方面达到了最优值,同时保持了较小的模型大小。与YOLOv9相比,YOLOv9是本研究中使用的数据集上YOLO系列中性能最佳的算法,我们的模型具有几个优势。具体而言,与更深的YOLOv9变体相比,它以更少的参数和更低的计算需求保持了卓越的整体性能。此外,与YOLOv9s相比,我们的模型在精度、召回率和mAP方面表现更好,同时还具有更少的GFLOP、更少的参数数量和更小的模型大小。