• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于双骨干网络的钢材表面缺陷检测:MBDNet-Attention-YOLO

Detection of Surface Defects in Steel Based on Dual-Backbone Network: MBDNet-Attention-YOLO.

作者信息

Wang Xinyu, Ma Shuhui, Wu Shiting, Li Zhaoye, Cao Jinrong, Xu Peiquan

机构信息

School of Materials Science and Engineering, Shanghai University of Engineering Science, Shanghai 201620, China.

School of Arts and Sciences, Northeast Agricultural University, Harbin 150030, China.

出版信息

Sensors (Basel). 2025 Aug 5;25(15):4817. doi: 10.3390/s25154817.

DOI:10.3390/s25154817
PMID:40807981
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12349437/
Abstract

Automated surface defect detection in steel manufacturing is pivotal for ensuring product quality, yet it remains an open challenge owing to the extreme heterogeneity of defect morphologies-ranging from hairline cracks and microscopic pores to elongated scratches and shallow dents. Existing approaches, whether classical vision pipelines or recent deep-learning paradigms, struggle to simultaneously satisfy the stringent demands of industrial scenarios: high accuracy on sub-millimeter flaws, insensitivity to texture-rich backgrounds, and real-time throughput on resource-constrained hardware. Although contemporary detectors have narrowed the gap, they still exhibit pronounced sensitivity-robustness trade-offs, particularly in the presence of scale-varying defects and cluttered surfaces. To address these limitations, we introduce MBY (MBDNet-Attention-YOLO), a lightweight yet powerful framework that synergistically couples the MBDNet backbone with the YOLO detection head. Specifically, the backbone embeds three novel components: (1) HGStem, a hierarchical stem block that enriches low-level representations while suppressing redundant activations; (2) Dynamic Align Fusion (DAF), an adaptive cross-scale fusion mechanism that dynamically re-weights feature contributions according to defect saliency; and (3) C2f-DWR, a depth-wise residual variant that progressively expands receptive fields without incurring prohibitive computational costs. Building upon this enriched feature hierarchy, the neck employs our proposed MultiSEAM module-a cascaded squeeze-and-excitation attention mechanism operating at multiple granularities-to harmonize fine-grained and semantic cues, thereby amplifying weak defect signals against complex textures. Finally, we integrate the Inner-SIoU loss, which refines the geometric alignment between predicted and ground-truth boxes by jointly optimizing center distance, aspect ratio consistency, and IoU overlap, leading to faster convergence and tighter localization. Extensive experiments on two publicly available steel-defect benchmarks-NEU-DET and PVEL-AD-demonstrate the superiority of MBY. Without bells and whistles, our model achieves 85.8% mAP@0.5 on NEU-DET and 75.9% mAP@0.5 on PVEL-AD, surpassing the best-reported results by significant margins while maintaining real-time inference on an NVIDIA Jetson Xavier. Ablation studies corroborate the complementary roles of each component, underscoring MBY's robustness across defect scales and surface conditions. These results suggest that MBY strikes an appealing balance between accuracy, efficiency, and deployability, offering a pragmatic solution for next-generation industrial quality-control systems.

摘要

钢铁制造中的自动化表面缺陷检测对于确保产品质量至关重要,但由于缺陷形态的极端异质性(从发丝裂缝、微观孔隙到细长划痕和浅凹痕),这仍然是一个悬而未决的挑战。现有的方法,无论是传统的视觉管道还是最近的深度学习范式,都难以同时满足工业场景的严格要求:对亚毫米级缺陷的高精度、对纹理丰富背景的不敏感以及在资源受限硬件上的实时吞吐量。尽管当代探测器已经缩小了差距,但它们仍然表现出明显的灵敏度 - 鲁棒性权衡,特别是在存在尺度变化的缺陷和杂乱表面的情况下。为了解决这些限制,我们引入了MBY(MBDNet - 注意力 - YOLO),这是一个轻量级但强大的框架,它将MBDNet骨干与YOLO检测头协同耦合。具体来说,骨干嵌入了三个新颖的组件:(1)HGStem,一种分层主干块,在抑制冗余激活的同时丰富低级表示;(2)动态对齐融合(DAF),一种自适应跨尺度融合机制,根据缺陷显著性动态重新加权特征贡献;(3)C2f - DWR,一种深度可分离残差变体,在不产生过高计算成本的情况下逐步扩大感受野。基于这种丰富的特征层次结构,颈部采用了我们提出的MultiSEAM模块——一种在多个粒度上运行的级联挤压和激励注意力机制——来协调细粒度和语义线索,从而在复杂纹理中放大微弱的缺陷信号。最后,我们集成了Inner - SIoU损失,通过联合优化中心距离、纵横比一致性和IoU重叠来细化预测框和真实框之间的几何对齐,从而实现更快的收敛和更紧密的定位。在两个公开可用的钢铁缺陷基准数据集——NEU - DET和PVEL - AD上进行的广泛实验证明了MBY的优越性。在没有花里胡哨功能的情况下,我们 的模型在NEU - DET上达到了85.8% mAP@0.5,在PVEL - AD上达到了75.9% mAP@0.5,大幅超越了之前报道的最佳结果,同时在NVIDIA Jetson Xavier上保持实时推理。消融研究证实了每个组件的互补作用,强调了MBY在不同缺陷尺度和表面条件下的鲁棒性。这些结果表明,MBY在准确性、效率和可部署性之间取得了诱人的平衡,为下一代工业质量控制系统提供了一个实用的解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/ad8145069b71/sensors-25-04817-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/7fb9caff4144/sensors-25-04817-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/5238d0a038b7/sensors-25-04817-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/64356b7c60ce/sensors-25-04817-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/49c66050bed2/sensors-25-04817-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/ce8ad17af068/sensors-25-04817-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/ea9c723aeccb/sensors-25-04817-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/3c35d74d6de4/sensors-25-04817-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/b8203ed3454e/sensors-25-04817-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/345a109eae3b/sensors-25-04817-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/a77f41f68980/sensors-25-04817-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/75b3e1254d1c/sensors-25-04817-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/4607bd9b4bad/sensors-25-04817-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/ad8145069b71/sensors-25-04817-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/7fb9caff4144/sensors-25-04817-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/5238d0a038b7/sensors-25-04817-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/64356b7c60ce/sensors-25-04817-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/49c66050bed2/sensors-25-04817-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/ce8ad17af068/sensors-25-04817-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/ea9c723aeccb/sensors-25-04817-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/3c35d74d6de4/sensors-25-04817-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/b8203ed3454e/sensors-25-04817-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/345a109eae3b/sensors-25-04817-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/a77f41f68980/sensors-25-04817-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/75b3e1254d1c/sensors-25-04817-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/4607bd9b4bad/sensors-25-04817-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c3e/12349437/ad8145069b71/sensors-25-04817-g013.jpg

相似文献

1
Detection of Surface Defects in Steel Based on Dual-Backbone Network: MBDNet-Attention-YOLO.基于双骨干网络的钢材表面缺陷检测:MBDNet-Attention-YOLO
Sensors (Basel). 2025 Aug 5;25(15):4817. doi: 10.3390/s25154817.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
DSAT: a dynamic sparse attention transformer for steel surface defect detection with hierarchical feature fusion.DSAT:一种用于钢表面缺陷检测的动态稀疏注意力变换器,具有分层特征融合。
Sci Rep. 2025 Aug 9;15(1):29198. doi: 10.1038/s41598-025-14935-8.
4
Multi-level channel-spatial attention and light-weight scale-fusion network (MCSLF-Net): multi-level channel-spatial attention and light-weight scale-fusion transformer for 3D brain tumor segmentation.多级通道空间注意力与轻量级尺度融合网络(MCSLF-Net):用于3D脑肿瘤分割的多级通道空间注意力与轻量级尺度融合变换器
Quant Imaging Med Surg. 2025 Jul 1;15(7):6301-6325. doi: 10.21037/qims-2025-354. Epub 2025 Jun 30.
5
Fast Anomaly Detection for Vision-Based Industrial Inspection Using Cascades of Null Subspace PCA Detectors.使用零子空间主成分分析检测器级联的基于视觉的工业检测快速异常检测
Sensors (Basel). 2025 Aug 7;25(15):4853. doi: 10.3390/s25154853.
6
Short-Term Memory Impairment短期记忆障碍
7
PEYOLO a perception efficient network for multiscale surface defects detection.PEYOLO:一种用于多尺度表面缺陷检测的感知高效网络。
Sci Rep. 2025 Aug 6;15(1):28804. doi: 10.1038/s41598-025-05574-0.
8
YOLO-RDM: A high accuracy and efficient algorithm for magnetic tile surface defect detection with practical applications.YOLO-RDM:一种用于磁瓦表面缺陷检测的高精度高效算法及其实际应用
PLoS One. 2025 Jul 18;20(7):e0328815. doi: 10.1371/journal.pone.0328815. eCollection 2025.
9
LEAD-YOLO: A Lightweight and Accurate Network for Small Object Detection in Autonomous Driving.LEAD-YOLO:一种用于自动驾驶中小目标检测的轻量级且精确的网络。
Sensors (Basel). 2025 Aug 4;25(15):4800. doi: 10.3390/s25154800.
10
AC-YOLO: A lightweight ship detection model for SAR images based on YOLO11.AC - YOLO:一种基于YOLO11的用于合成孔径雷达(SAR)图像的轻量级船舶检测模型。
PLoS One. 2025 Jul 30;20(7):e0327362. doi: 10.1371/journal.pone.0327362. eCollection 2025.

本文引用的文献

1
Align While Fusion: A Generalized Nonaligned Multiview Multilabel Classification Method.融合时对齐:一种广义的非对齐多视图多标签分类方法。
IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):7627-7636. doi: 10.1109/TNNLS.2024.3387577. Epub 2025 Apr 4.
2
Visual Sensing and Depth Perception for Welding Robots and Their Industrial Applications.焊接机器人的视觉传感与深度感知及其工业应用
Sensors (Basel). 2023 Dec 8;23(24):9700. doi: 10.3390/s23249700.
3
MSFT-YOLO: Improved YOLOv5 Based on Transformer for Detecting Defects of Steel Surface.
微软-你只看一次(MSFT-YOLO):基于Transformer改进的YOLOv5用于检测钢表面缺陷
Sensors (Basel). 2022 May 2;22(9):3467. doi: 10.3390/s22093467.
4
Enhancing Geometric Factors in Model Learning and Inference for Object Detection and Instance Segmentation.增强目标检测与实例分割模型学习与推理中的几何因素
IEEE Trans Cybern. 2022 Aug;52(8):8574-8586. doi: 10.1109/TCYB.2021.3095305. Epub 2022 Jul 19.