Suppr超能文献

PCTC-Net:一种具有并行双编码器网络的裂缝分割网络,该网络融合了基于预卷积的Transformer和卷积神经网络。

PCTC-Net: A Crack Segmentation Network with Parallel Dual Encoder Network Fusing Pre-Conv-Based Transformers and Convolutional Neural Networks.

作者信息

Moon Ji-Hwan, Choi Gyuho, Kim Yu-Hwan, Kim Won-Yeol

机构信息

Department of Artificial Intelligence Engineering, Chosun University, Gwangju 61452, Republic of Korea.

Department of Computer Engineering, Chosun University, Gwangju 61452, Republic of Korea.

出版信息

Sensors (Basel). 2024 Feb 24;24(5):1467. doi: 10.3390/s24051467.

Abstract

Cracks are common defects that occur on the surfaces of objects and structures. Crack detection is a critical maintenance task that traditionally requires manual labor. Large-scale manual inspections are expensive. Research has been conducted to replace expensive human labor with cheaper computing resources. Recently, crack segmentation based on convolutional neural networks (CNNs) and transformers has been actively investigated for local and global information. However, the transformer is data-intensive owing to its weak inductive bias. Existing labeled datasets for crack segmentation are relatively small. Additionally, a limited amount of fine-grained crack data is available. To address this data-intensive problem, we propose a parallel dual encoder network fusing Pre-Conv-based Transformers and convolutional neural networks (PCTC-Net). The Pre-Conv module automatically optimizes each color channel with a small spatial kernel before the input of the transformer. The proposed model, PCTC-Net, was tested with the DeepCrack, Crack500, and Crackseg9k datasets. The experimental results showed that our model achieved higher generalization performance, stability, and F1 scores than the SOTA model DTrC-Net.

摘要

裂缝是物体和结构表面常见的缺陷。裂缝检测是一项关键的维护任务,传统上需要人工操作。大规模的人工检查成本高昂。人们进行了相关研究,试图用成本更低的计算资源取代昂贵的人力。最近,基于卷积神经网络(CNN)和变换器的裂缝分割技术因能处理局部和全局信息而受到积极研究。然而,变换器由于其较弱的归纳偏置而数据密集。现有的用于裂缝分割的标注数据集相对较小。此外,细粒度裂缝数据的数量有限。为了解决这个数据密集型问题,我们提出了一种融合基于预卷积的变换器和卷积神经网络的并行双编码器网络(PCTC-Net)。预卷积模块在变换器输入之前,使用小空间内核自动优化每个颜色通道。所提出的模型PCTC-Net在DeepCrack、Crack500和Crackseg9k数据集上进行了测试。实验结果表明,我们的模型比当前最优模型DTrC-Net具有更高的泛化性能、稳定性和F1分数。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79c0/10934755/e72ba540a40d/sensors-24-01467-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验