• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于少参数量孔卷积神经网络的机器视觉检测快速语义分割方法。

Fast semantic segmentation method for machine vision inspection based on a fewer-parameters atrous convolution neural network.

机构信息

School of Mechanical and Automotive Engineering, South China University of Technology, Guangzhou, Guangdong Province, China.

出版信息

PLoS One. 2021 Feb 10;16(2):e0246093. doi: 10.1371/journal.pone.0246093. eCollection 2021.

DOI:10.1371/journal.pone.0246093
PMID:33566844
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7875430/
Abstract

Owing to the recent development in deep learning, machine vision has been widely used in intelligent manufacturing equipment in multiple fields, including precision-manufacturing production lines and online product-quality inspection. This study aims at online Machine Vision Inspection, focusing on the method of online semantic segmentation under complex backgrounds. First, the fewer-parameters optimization of the atrous convolution architecture is studied. Atrous spatial pyramid pooling (ASPP) and residual network (ResNet) are selected as the basic architectures of ηseg and ηmain, respectively, which indicate that the improved proportion of the participating input image feature is beneficial for improving the accuracy of feature extraction during the change of the number and dimension of feature maps. Second, this study proposes five modified ResNet residual building blocks, with the main path having a 3 × 3 convolution layer, 2 × 2 skip path, and pooling layer with ls = 2, which can improve the use of image features. Finally, the simulation experiments show that our modified structure can significantly decrease segmentation time Tseg from 719 to 296 ms (decreased by 58.8%), with only a slight decrease in the intersection-over-union from 86.7% to 86.6%. The applicability of the proposed machine vision method was verified through the segmentation recognition of the China Yuan (CNY) for the 2019 version. Compared with the conventional method, the proposed model of semantic segmentation visual detection effectively reduces the detection time while ensuring the detection accuracy and has a significant effect of fewer-parameters optimization. This slows for the possibility of neural network detection on mobile terminals.

摘要

由于深度学习的最新发展,机器视觉已广泛应用于多个领域的智能制造设备中,包括精密制造生产线和在线产品质量检查。本研究旨在进行在线机器视觉检查,重点是复杂背景下的在线语义分割方法。首先,研究了空洞卷积结构的更少参数优化。采用空洞空间金字塔池化(ASPP)和残差网络(ResNet)分别作为ηseg和ηmain 的基本架构,这表明改进参与输入图像特征的比例有利于在特征图的数量和维度变化时提高特征提取的准确性。其次,本研究提出了五个改进的 ResNet 残差构建块,主路径具有 3×3 卷积层、2×2 跳过路径和池化层 ls=2,可以提高图像特征的利用率。最后,仿真实验表明,我们的改进结构可以将分割时间 Tseg 从 719 毫秒显著减少到 296 毫秒(减少了 58.8%),而交并比仅从 86.7%略微下降到 86.6%。通过对 2019 年版人民币的分割识别验证了所提出的机器视觉方法的适用性。与传统方法相比,所提出的语义分割视觉检测模型在有效减少检测时间的同时保证了检测精度,具有显著的少参数优化效果。这为神经网络在移动终端上的检测提供了可能性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/ceab8517a6eb/pone.0246093.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/3c0d6ae8f422/pone.0246093.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/0e5e9c17260a/pone.0246093.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/196f5803b9ae/pone.0246093.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/7e158d0a6421/pone.0246093.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/49a4154ef8e3/pone.0246093.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/e3fae64efdfe/pone.0246093.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/7a5d03c4540d/pone.0246093.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/16c3c7aa6197/pone.0246093.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/438637c34302/pone.0246093.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/749c23be9f67/pone.0246093.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/c891ef738846/pone.0246093.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/f99fee214acb/pone.0246093.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/d893a13c77c6/pone.0246093.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/474634b9b1f0/pone.0246093.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/ceab8517a6eb/pone.0246093.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/3c0d6ae8f422/pone.0246093.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/0e5e9c17260a/pone.0246093.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/196f5803b9ae/pone.0246093.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/7e158d0a6421/pone.0246093.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/49a4154ef8e3/pone.0246093.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/e3fae64efdfe/pone.0246093.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/7a5d03c4540d/pone.0246093.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/16c3c7aa6197/pone.0246093.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/438637c34302/pone.0246093.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/749c23be9f67/pone.0246093.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/c891ef738846/pone.0246093.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/f99fee214acb/pone.0246093.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/d893a13c77c6/pone.0246093.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/474634b9b1f0/pone.0246093.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8404/7875430/ceab8517a6eb/pone.0246093.g015.jpg

相似文献

1
Fast semantic segmentation method for machine vision inspection based on a fewer-parameters atrous convolution neural network.基于少参数量孔卷积神经网络的机器视觉检测快速语义分割方法。
PLoS One. 2021 Feb 10;16(2):e0246093. doi: 10.1371/journal.pone.0246093. eCollection 2021.
2
A multiple-channel and atrous convolution network for ultrasound image segmentation.一种用于超声图像分割的多通道多孔卷积网络。
Med Phys. 2020 Dec;47(12):6270-6285. doi: 10.1002/mp.14512. Epub 2020 Oct 18.
3
ACDSSNet: Atrous Convolution-Based Deep Semantic Segmentation Network for Efficient Detection of Sickle Cell Anemia.ACDSSNet:基于空洞卷积的深度语义分割网络,用于高效检测镰状细胞贫血。
IEEE J Biomed Health Inform. 2024 Oct;28(10):5676-5684. doi: 10.1109/JBHI.2024.3362843. Epub 2024 Oct 3.
4
Novel Method of Semantic Segmentation Applicable to Augmented Reality.适用于增强现实的语义分割新方法。
Sensors (Basel). 2020 Mar 20;20(6):1737. doi: 10.3390/s20061737.
5
Waterfall Atrous Spatial Pooling Architecture for Efficient Semantic Segmentation.用于高效语义分割的瀑布空洞空间池化架构。
Sensors (Basel). 2019 Dec 5;19(24):5361. doi: 10.3390/s19245361.
6
Deep Neural Network-Based Semantic Segmentation of Microvascular Decompression Images.基于深度神经网络的微血管减压术图像语义分割
Sensors (Basel). 2021 Feb 7;21(4):1167. doi: 10.3390/s21041167.
7
ADR-Net: Context extraction network based on M-Net for medical image segmentation.ADR-Net:基于M-Net的医学图像分割上下文提取网络。
Med Phys. 2020 Sep;47(9):4254-4264. doi: 10.1002/mp.14364. Epub 2020 Aug 2.
8
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
9
Multi-Scale Deep Neural Network Based on Dilated Convolution for Spacecraft Image Segmentation.基于扩张卷积的多尺度深度神经网络在航天器图像分割中的应用。
Sensors (Basel). 2022 Jun 1;22(11):4222. doi: 10.3390/s22114222.
10
[Segmentation of ground glass pulmonary nodules using full convolution residual network based on atrous spatial pyramid pooling structure and attention mechanism].基于空洞空间金字塔池化结构和注意力机制的全卷积残差网络用于磨玻璃肺结节分割
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2022 Jun 25;39(3):441-451. doi: 10.7507/1001-5515.202010051.

引用本文的文献

1
Predicting geriatric environmental safety perception assessment using LightGBM and SHAP framework.使用LightGBM和SHAP框架预测老年人环境安全感知评估
Sci Rep. 2025 Jul 28;15(1):27444. doi: 10.1038/s41598-025-12541-2.
2
SPNet: Structure preserving network for depth completion.SPNet:用于深度完成的结构保持网络。
PLoS One. 2023 Jan 24;18(1):e0280886. doi: 10.1371/journal.pone.0280886. eCollection 2023.

本文引用的文献

1
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab:基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.
2
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.SegNet:一种用于图像分割的深度卷积编解码器架构。
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615. Epub 2017 Jan 2.
3
Fully Convolutional Networks for Semantic Segmentation.
全卷积网络用于语义分割。
IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):640-651. doi: 10.1109/TPAMI.2016.2572683. Epub 2016 May 24.
4
Robot Guidance Using Machine Vision Techniques in Industrial Environments: A Comparative Review.工业环境中基于机器视觉技术的机器人引导:比较综述
Sensors (Basel). 2016 Mar 5;16(3):335. doi: 10.3390/s16030335.
5
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.