Saberironaghi Alireza, Ren Jing
Electrical, Computer and Software Engineering, Ontario Tech University, Oshawa, ON L1G 0C5, Canada.
J Imaging. 2024 Apr 26;10(5):100. doi: 10.3390/jimaging10050100.
Detecting cracks in the pavement is a vital component of ensuring road safety. Since manual identification of these cracks can be time-consuming, an automated method is needed to speed up this process. However, creating such a system is challenging due to factors including crack variability, variations in pavement materials, and the occurrence of miscellaneous objects and anomalies on the pavement. Motivated by the latest progress in deep learning applied to computer vision, we propose an effective U-Net-shaped model named DepthCrackNet. Our model employs the Double Convolution Encoder (DCE), composed of a sequence of convolution layers, for robust feature extraction while keeping parameters optimally efficient. We have incorporated the TriInput Multi-Head Spatial Attention (TMSA) module into our model; in this module, each head operates independently, capturing various spatial relationships and boosting the extraction of rich contextual information. Furthermore, DepthCrackNet employs the Spatial Depth Enhancer (SDE) module, specifically designed to augment the feature extraction capabilities of our segmentation model. The performance of the DepthCrackNet was evaluated on two public crack datasets: Crack500 and DeepCrack. In our experimental studies, the network achieved mIoU scores of 77.0% and 83.9% with the Crack500 and DeepCrack datasets, respectively.
检测路面裂缝是确保道路安全的重要组成部分。由于人工识别这些裂缝可能很耗时,因此需要一种自动化方法来加快这一过程。然而,由于裂缝的多变性、路面材料的差异以及路面上杂物和异常情况的出现等因素,创建这样一个系统具有挑战性。受深度学习在计算机视觉领域最新进展的启发,我们提出了一种有效的名为DepthCrackNet的U型网络模型。我们的模型采用由一系列卷积层组成的双卷积编码器(DCE)进行稳健的特征提取,同时保持参数的最优效率。我们在模型中融入了三输入多头空间注意力(TMSA)模块;在这个模块中,每个头独立运行,捕捉各种空间关系并增强丰富上下文信息的提取。此外,DepthCrackNet采用了空间深度增强器(SDE)模块,专门设计用于增强我们分割模型的特征提取能力。在两个公开的裂缝数据集Crack500和DeepCrack上对DepthCrackNet的性能进行了评估。在我们的实验研究中,该网络在Crack500和DeepCrack数据集上分别取得了77.0%和83.9%的平均交并比分数。