Fan Yawen, Hu Zhengkai, Li Qinxin, Sun Yang, Chen Jianxin, Zhou Quan
National Engineering Research Center of Communications and Networking, Nanjing University of Posts & Telecommunications, Nanjing 210003, China.
Artificial Intelligence of Key Laboratory of Sichuan Province, Sichuan University of Science and Engineering, Yibin 644000, China.
Sensors (Basel). 2024 Nov 6;24(22):7134. doi: 10.3390/s24227134.
Cracks are a common form of damage in infrastructure, posing significant risks to both personal safety and property. Along with the development of deep learning, visual-based crack automatic detection has been widely studied. However, this task is still challenging due to complex crack topology, noisy backgrounds, unbalanced categories, etc. To address these challenges, this research proposes a novel hybrid network, named CrackNet, which leverages the strengths of both CNN and transformer. On the encoder side, CNNs are employed to extract multi-level local features, while transformers are used to model global dependencies. Additionally, a strip pooling module is introduced to suppress irrelevant regions and enhance the network's ability to segment narrow and elongated cracks. On the decoder side, an attention-based skip connection strategy and a mixed up-sampling module are implemented to restore detailed information. Furthermore, a joint learning loss combining Dice and cross-entropy with dynamic weighting is proposed to mitigate the effects of severe class imbalance. CrackNet is trained and evaluated on three public crack datasets, and experimental results show that the proposed model outperforms several well-known deep neural networks, with a particularly noticeable improvement in recall rate.
裂缝是基础设施中常见的损伤形式,对人身安全和财产都构成重大风险。随着深度学习的发展,基于视觉的裂缝自动检测受到了广泛研究。然而,由于裂缝拓扑复杂、背景噪声、类别不平衡等原因,这项任务仍然具有挑战性。为了应对这些挑战,本研究提出了一种新型混合网络,名为CrackNet,它利用了卷积神经网络(CNN)和Transformer的优势。在编码器端,使用CNN提取多级局部特征,而Transformer用于建模全局依赖性。此外,引入了带状池化模块来抑制无关区域,并增强网络分割狭窄和细长裂缝的能力。在解码器端,实现了基于注意力的跳跃连接策略和混合上采样模块来恢复详细信息。此外,还提出了一种结合Dice和交叉熵并带有动态加权的联合学习损失,以减轻严重类别不平衡的影响。在三个公共裂缝数据集上对CrackNet进行了训练和评估,实验结果表明,所提出的模型优于几个著名的深度神经网络,召回率有特别显著的提高。