Guangdong ChengTech Tranffic&Technology Development Co., Ltd, Guangzhou, China.
Guangzhou Road Research Institute Co., Ltd, Guangzhou, China.
PLoS One. 2024 Oct 10;19(10):e0309172. doi: 10.1371/journal.pone.0309172. eCollection 2024.
Automatic pavement disease detection aims to address the inefficiency in practical detection. However, traditional methods heavily rely on low-level image analysis, handcrafted features, and classical classifiers, leading to limited effectiveness and poor generalization in complex scenarios. Although significant progress has been made with deep learning methods, challenges persist in handling high-resolution images and diverse disease types. Therefore, this paper proposes a novel approach based on the lightweight Transformer Patch Labeling Network (LTPLN) to enhance the efficiency of automatic pavement disease detection and overcome the limitations of existing methods. Firstly, the input images undergo histogram equalization preprocessing to enhance image quality. Subsequently, the images are evenly partitioned into small patch blocks, serving as inputs to the enhanced Transformer model. This enhancement strategy involves integrating feature map labels at each layer of the model to reduce computational complexity and enhance model lightweightness. Furthermore, a depthwise separable convolution module is introduced into the Transformer architecture to introduce convolutional bias and reduce the model's dependence on large amounts of data. Finally, an iterative training process utilizing the label distillation strategy based on expectation maximization is employed to update the labels of patch blocks and roughly locate the positions of pavement diseases under weak supervision. Experimental results demonstrate that compared to the baseline model, the proposed enhanced model achieves a reduction of 2.5G Flops computational complexity and a 16% speed improvement on a private pavement disease dataset, with only a 1.2 percentage point decrease in AUC accuracy. Moreover, compared to other mainstream image classification models, this model exhibits more balanced performance on a public dataset, with improved accuracy and speed that better align with the practical requirements of pavement inspection. These findings highlight the significant performance advantages of the LTPLN model in automatic pavement disease detection tasks, making it more efficiently applicable in real-world scenarios.
自动路面病害检测旨在解决实际检测中的效率问题。然而,传统方法严重依赖于低级图像分析、手工制作的特征和经典分类器,导致在复杂场景中的效果有限且泛化能力差。尽管深度学习方法取得了显著进展,但在处理高分辨率图像和多种病害类型方面仍存在挑战。因此,本文提出了一种基于轻量级 Transformer 补丁标注网络 (LTPLN) 的新方法,以提高自动路面病害检测的效率,并克服现有方法的局限性。首先,输入图像经过直方图均衡化预处理,以增强图像质量。然后,将图像均匀划分为小补丁块,作为增强型 Transformer 模型的输入。这种增强策略涉及在模型的每个层集成特征图标签,以减少计算复杂度并增强模型的轻量级性。此外,在 Transformer 架构中引入了深度可分离卷积模块,以引入卷积偏差并减少模型对大量数据的依赖。最后,利用基于期望最大化的标签蒸馏策略进行迭代训练过程,更新补丁块的标签,并在弱监督下大致定位路面病害的位置。实验结果表明,与基线模型相比,所提出的增强模型在私有路面病害数据集上实现了 2.5G Flops 计算复杂度的降低和 16%的速度提升,AUC 准确率仅下降了 1.2 个百分点。此外,与其他主流图像分类模型相比,该模型在公共数据集上表现出更均衡的性能,具有更高的准确性和速度,更符合路面检测的实际要求。这些发现突出了 LTPLN 模型在自动路面病害检测任务中的显著性能优势,使其在实际场景中更高效地应用。