Moon Ji-Hwan, Choi Gyuho, Kim Yu-Hwan, Kim Won-Yeol
Department of Artificial Intelligence Engineering, Chosun University, Gwangju 61452, Republic of Korea.
Department of Computer Engineering, Chosun University, Gwangju 61452, Republic of Korea.
Sensors (Basel). 2024 Feb 24;24(5):1467. doi: 10.3390/s24051467.
Cracks are common defects that occur on the surfaces of objects and structures. Crack detection is a critical maintenance task that traditionally requires manual labor. Large-scale manual inspections are expensive. Research has been conducted to replace expensive human labor with cheaper computing resources. Recently, crack segmentation based on convolutional neural networks (CNNs) and transformers has been actively investigated for local and global information. However, the transformer is data-intensive owing to its weak inductive bias. Existing labeled datasets for crack segmentation are relatively small. Additionally, a limited amount of fine-grained crack data is available. To address this data-intensive problem, we propose a parallel dual encoder network fusing Pre-Conv-based Transformers and convolutional neural networks (PCTC-Net). The Pre-Conv module automatically optimizes each color channel with a small spatial kernel before the input of the transformer. The proposed model, PCTC-Net, was tested with the DeepCrack, Crack500, and Crackseg9k datasets. The experimental results showed that our model achieved higher generalization performance, stability, and F1 scores than the SOTA model DTrC-Net.
裂缝是物体和结构表面常见的缺陷。裂缝检测是一项关键的维护任务,传统上需要人工操作。大规模的人工检查成本高昂。人们进行了相关研究,试图用成本更低的计算资源取代昂贵的人力。最近,基于卷积神经网络(CNN)和变换器的裂缝分割技术因能处理局部和全局信息而受到积极研究。然而,变换器由于其较弱的归纳偏置而数据密集。现有的用于裂缝分割的标注数据集相对较小。此外,细粒度裂缝数据的数量有限。为了解决这个数据密集型问题,我们提出了一种融合基于预卷积的变换器和卷积神经网络的并行双编码器网络(PCTC-Net)。预卷积模块在变换器输入之前,使用小空间内核自动优化每个颜色通道。所提出的模型PCTC-Net在DeepCrack、Crack500和Crackseg9k数据集上进行了测试。实验结果表明,我们的模型比当前最优模型DTrC-Net具有更高的泛化性能、稳定性和F1分数。