Hu Yuanyuan, Chen Ning, Zhang Hancheng, Hou Yue, Liu Pengfei
Institute of Highway Engineering, RWTH Aachen University, Aachen, Germany.
Beijing Key Laboratory of Traffic Engineering, Beijing University of Technology, Beijing, China.
Research (Wash D C). 2025 Aug 25;8:0833. doi: 10.34133/research.0833. eCollection 2025.
During the designed service life, road infrastructures will bear repeated loading conditions from vehicle weights and environmental conditions, resulting in the inevitable occurrence of road distresses including cracks, potholes, etc. The traditional inspection methods by transportation engineers are normally costly and labor-intensive. In recent years, artificial intelligence (AI)-based road distress detection methods have been widely used as convenient and automated approaches, while the AI-based methods heavily depend on a large amount of high-quality images, limiting the real engineering applications. To address the issues, this study introduces RoadDiffBox, a novel framework employing controlled image generation and semi-supervised learning. The framework addresses dataset imbalances through class control and accelerates image generation by utilizing the denoising diffusion implicit model's reverse process sampling method, while employing knowledge distillation techniques optimized for resource-constrained mobile devices. It generates diverse and high-quality road distress images with automatic bounding box annotations, substantially reducing manual labeling requirements. Test results show that RoadDiffBox demonstrates strong generalizability across geographic regions (Germany, China, and India) and shows cross-domain potential in medical imaging applications. Performance evaluations demonstrate RoadDiffBox's effectiveness, with classification models achieving an F1-score of 0.95 and detection models reaching a mean average precision (mAP@50) of 0.95 and an F1-score of 0.91 in controlled settings, while maintaining robust performance (an F1-score of 0.86 and a mAP@50 of 0.91) during on-site testing in real-world conditions. On server-class hardware, the model achieves generation times as low as 0.18 s per image. It is discovered that RoadDiffBox can serve as a scalable and efficient solution for real-time road maintenance with limited datasets.
在设计使用寿命期间,道路基础设施将承受来自车辆重量和环境条件的反复荷载作用,导致不可避免地出现包括裂缝、坑洼等在内的道路病害。交通工程师采用的传统检测方法通常成本高昂且 labor-intensive。近年来,基于人工智能(AI)的道路病害检测方法作为方便且自动化的方法被广泛使用,然而基于AI的方法严重依赖大量高质量图像,限制了实际工程应用。为了解决这些问题,本研究引入了RoadDiffBox,这是一个采用可控图像生成和半监督学习的新颖框架。该框架通过类别控制解决数据集不平衡问题,并利用去噪扩散隐模型的反向过程采样方法加速图像生成,同时采用针对资源受限移动设备优化的知识蒸馏技术。它生成具有自动边界框注释的多样且高质量的道路病害图像,大幅减少人工标注需求。测试结果表明,RoadDiffBox在不同地理区域(德国、中国和印度)具有很强的通用性,并在医学成像应用中显示出跨领域潜力。性能评估证明了RoadDiffBox的有效性,在受控设置下,分类模型的F1分数达到0.95,检测模型的平均精度均值(mAP@50)达到0.95,F1分数达到0.91,而在实际现场测试中保持稳健性能(F1分数为0.86,mAP@50为0.91)。在服务器级硬件上,该模型的图像生成时间低至每秒0.18秒。研究发现,RoadDiffBox可以作为一种可扩展且高效的解决方案,用于在数据集有限的情况下进行实时道路维护。