• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过基于PConv的微调与自动提示器利用视觉基础模型进行缺陷分割。

Leveraging Vision Foundation Model via PConv-Based Fine-Tuning with Automated Prompter for Defect Segmentation.

作者信息

Jiang Yifan, Chen Jinshui, Lu Jiangang

机构信息

State Key Laboratory of Industrial Control Technology, College of Control Science and Engineering, Zhejiang University, Hangzhou 310027, China.

出版信息

Sensors (Basel). 2025 Apr 11;25(8):2417. doi: 10.3390/s25082417.

DOI:10.3390/s25082417
PMID:40285107
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12031572/
Abstract

In industrial scenarios, image segmentation is essential for accurately identifying defect regions. Recently, the emergence of foundation models driven by powerful computational resources and large-scale training data has brought about a paradigm shift in deep learning-based image segmentation. The Segment Anything Model (SAM) has shown exceptional performance across various downstream tasks, owing to its vast semantic knowledge and strong generalization capabilities. However, the feature distribution discrepancy, reliance on manually labeled prompts, and limited category information of SAM reduce its scalability in industrial settings. To address these issues, we propose PA-SAM, an industrial defect segmentation framework based on SAM. Firstly, to bridge the gap between SAM's pre-training data and distinct characteristics of industrial defects, we introduce a parameter-efficient fine-tuning (PEFT) technique incorporating lightweight Multi-Scale Partial Convolution Aggregation (MSPCA) into Low-Rank Adaptation (LoRA), named MSPCA-LoRA, which effectively enhances the image encoder's sensitivity to prior knowledge biases, while maintaining PEFT efficiency. Furthermore, we present the Image-to-Prompt Embedding Generator (IPEG), which utilizes image embeddings to autonomously create high-quality prompt embeddings for directing mask segmentation, eliminating the limitations of manually provided prompts. Finally, we apply effective refinements to SAM's mask decoder, transforming SAM into an end-to-end semantic segmentation framework. On two real-world defect segmentation datasets, PA-SAM achieves mean Intersections over Union of 73.87% and 68.30%, as well as mean Dice coefficients of 84.90% and 80.22%, outperforming other state-of-the-art algorithms, further demonstrating its robust generalization and application potential.

摘要

在工业场景中,图像分割对于准确识别缺陷区域至关重要。近年来,由强大计算资源和大规模训练数据驱动的基础模型的出现,给基于深度学习的图像分割带来了范式转变。分割一切模型(SAM)凭借其丰富的语义知识和强大的泛化能力,在各种下游任务中展现出卓越性能。然而,SAM的特征分布差异、对人工标注提示的依赖以及有限的类别信息,降低了其在工业环境中的可扩展性。为了解决这些问题,我们提出了PA-SAM,一种基于SAM的工业缺陷分割框架。首先,为了弥合SAM预训练数据与工业缺陷独特特征之间的差距,我们引入了一种参数高效微调(PEFT)技术,将轻量级多尺度局部卷积聚合(MSPCA)融入低秩适应(LoRA),命名为MSPCA-LoRA,它有效增强了图像编码器对先验知识偏差的敏感性,同时保持了PEFT效率。此外,我们提出了图像到提示嵌入生成器(IPEG),它利用图像嵌入自动创建高质量的提示嵌入来指导掩码分割,消除了人工提供提示的局限性。最后,我们对SAM的掩码解码器进行有效改进,将SAM转变为一个端到端的语义分割框架。在两个真实世界的缺陷分割数据集上,PA-SAM的平均交并比分别达到73.87%和68.30%,平均Dice系数分别达到84.90%和80.22%,优于其他先进算法,进一步证明了其强大的泛化能力和应用潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/735c49ee0ac5/sensors-25-02417-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/30e11ab9e6a9/sensors-25-02417-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/fcd92f7bc302/sensors-25-02417-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/4ef3068a7189/sensors-25-02417-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/2c0903ce53a7/sensors-25-02417-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/ac637c87359c/sensors-25-02417-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/acdca6b5ab5a/sensors-25-02417-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/df7c31dab182/sensors-25-02417-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/c31257334f20/sensors-25-02417-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/77ee1ca0d651/sensors-25-02417-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/735c49ee0ac5/sensors-25-02417-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/30e11ab9e6a9/sensors-25-02417-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/fcd92f7bc302/sensors-25-02417-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/4ef3068a7189/sensors-25-02417-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/2c0903ce53a7/sensors-25-02417-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/ac637c87359c/sensors-25-02417-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/acdca6b5ab5a/sensors-25-02417-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/df7c31dab182/sensors-25-02417-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/c31257334f20/sensors-25-02417-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/77ee1ca0d651/sensors-25-02417-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/03b5/12031572/735c49ee0ac5/sensors-25-02417-g010.jpg

相似文献

1
Leveraging Vision Foundation Model via PConv-Based Fine-Tuning with Automated Prompter for Defect Segmentation.通过基于PConv的微调与自动提示器利用视觉基础模型进行缺陷分割。
Sensors (Basel). 2025 Apr 11;25(8):2417. doi: 10.3390/s25082417.
2
Sam2Rad: A segmentation model for medical images with learnable prompts.Sam2Rad:一种具有可学习提示的医学图像分割模型。
Comput Biol Med. 2025 Mar;187:109725. doi: 10.1016/j.compbiomed.2025.109725. Epub 2025 Feb 5.
3
MA-SAM: Modality-agnostic SAM adaptation for 3D medical image segmentation.MA-SAM:用于 3D 医学图像分割的模态无关 SAM 适配。
Med Image Anal. 2024 Dec;98:103310. doi: 10.1016/j.media.2024.103310. Epub 2024 Aug 22.
4
ProtoSAM-3D: Interactive semantic segmentation in volumetric medical imaging via a Segment Anything Model and mask-level prototypes.ProtoSAM-3D:通过分割一切模型和掩码级原型在体积医学成像中进行交互式语义分割。
Comput Med Imaging Graph. 2025 Apr;121:102501. doi: 10.1016/j.compmedimag.2025.102501. Epub 2025 Feb 1.
5
Segment anything model for medical image analysis: An experimental study.用于医学图像分析的分割模型:一项实验研究。
Med Image Anal. 2023 Oct;89:102918. doi: 10.1016/j.media.2023.102918. Epub 2023 Aug 2.
6
SPA: Leveraging the SAM with Spatial Priors Adapter for Enhanced Medical Image Segmentation.SPA:利用带有空间先验适配器的SAM进行增强型医学图像分割
IEEE J Biomed Health Inform. 2025 Jan 6;PP. doi: 10.1109/JBHI.2025.3526174.
7
Learnable Prompting SAM-Induced Knowledge Distillation for Semi-Supervised Medical Image Segmentation.用于半监督医学图像分割的可学习提示SAM诱导知识蒸馏
IEEE Trans Med Imaging. 2025 May;44(5):2295-2306. doi: 10.1109/TMI.2025.3530097. Epub 2025 May 2.
8
Enhancing Medical Imaging Segmentation with GB-SAM: A Novel Approach to Tissue Segmentation Using Granular Box Prompts.利用GB-SAM增强医学影像分割:一种使用粒度框提示进行组织分割的新方法。
Cancers (Basel). 2024 Jun 28;16(13):2391. doi: 10.3390/cancers16132391.
9
A segment anything model-guided and match-based semi-supervised segmentation framework for medical imaging.一种用于医学成像的基于段式分割模型引导和匹配的半监督分割框架。
Med Phys. 2025 Mar 29. doi: 10.1002/mp.17785.
10
[Not Available].[无可用内容]
Med Phys. 2024 Mar;51(3):2187-2199. doi: 10.1002/mp.16965. Epub 2024 Feb 6.

引用本文的文献

1
VIPS: Learning-View-Invariant Feature for Person Search.VIPS:用于行人搜索的学习视图不变特征
Sensors (Basel). 2025 Aug 29;25(17):5362. doi: 10.3390/s25175362.

本文引用的文献

1
Sam2Rad: A segmentation model for medical images with learnable prompts.Sam2Rad:一种具有可学习提示的医学图像分割模型。
Comput Biol Med. 2025 Mar;187:109725. doi: 10.1016/j.compbiomed.2025.109725. Epub 2025 Feb 5.
2
Segmentation of void defects in X-ray images of chip solder joints based on PCB-DeepLabV3 algorithm.基于PCB-DeepLabV3算法的芯片焊点X射线图像中空洞缺陷分割
Sci Rep. 2024 May 24;14(1):11925. doi: 10.1038/s41598-024-61643-w.
3
Segment anything in medical images.在医学图像中分割任何内容。
Nat Commun. 2024 Jan 22;15(1):654. doi: 10.1038/s41467-024-44824-z.
4
Segment anything model for medical images?用于医学图像的图像分割模型?
Med Image Anal. 2024 Feb;92:103061. doi: 10.1016/j.media.2023.103061. Epub 2023 Dec 7.
5
Enhancing Agricultural Image Segmentation with an Agricultural Segment Anything Model Adapter.使用农业分割一切模型适配器增强农业图像分割
Sensors (Basel). 2023 Sep 14;23(18):7884. doi: 10.3390/s23187884.
6
A review on modern defect detection models using DCNNs - Deep convolutional neural networks.基于 DCNN 的现代缺陷检测模型综述 - 深度卷积神经网络。
J Adv Res. 2021 Apr 23;35:33-48. doi: 10.1016/j.jare.2021.03.015. eCollection 2022 Jan.
7
Image Segmentation Using Deep Learning: A Survey.基于深度学习的图像分割技术综述。
IEEE Trans Pattern Anal Mach Intell. 2022 Jul;44(7):3523-3542. doi: 10.1109/TPAMI.2021.3059968. Epub 2022 Jun 3.
8
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.SegNet:一种用于图像分割的深度卷积编解码器架构。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.
9
Fully Convolutional Networks for Semantic Segmentation.全卷积网络用于语义分割。
IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):640-651. doi: 10.1109/TPAMI.2016.2572683. Epub 2016 May 24.