一种通过融合目标与上下文信息进行铁路线路缺陷检测的混合系统。

A Hybrid System for Defect Detection on Rail Lines through the Fusion of Object and Context Information.

作者信息

Zhukov Alexey, Rivero Alain, Benois-Pineau Jenny, Zemmari Akka, Mosbah Mohamed

机构信息

Univ. Bordeaux, CNRS, Bordeaux INP, INRIA, LaBRI, UMR 5800, 33400 Talence, France.

Ferrocampus, 18 bd Guillet-Maillet, 17100 Saintes, France.

出版信息

Sensors (Basel). 2024 Feb 10;24(4):1171. doi: 10.3390/s24041171.

DOI:10.3390/s24041171

PMID:38400331

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10892099/

Abstract

Defect detection on rail lines is essential for ensuring safe and efficient transportation. Current image analysis methods with deep neural networks (DNNs) for defect detection often focus on the defects themselves while ignoring the related context. In this work, we propose a fusion model that combines both a targeted defect search and a context analysis, which is seen as a multimodal fusion task. Our model performs rule-based decision-level fusion, merging the confidence scores of multiple individual models to classify rail-line defects. We call the model "hybrid" in the sense that it is composed of supervised learning components and rule-based fusion. We first propose an improvement to existing vision-based defect detection methods by incorporating a convolutional block attention module (CBAM) in the you only look once (YOLO) versions 5 (YOLOv5) and 8 (YOLOv8) architectures for the detection of defects and contextual image elements. This attention module is applied at different detection scales. The domain-knowledge rules are applied to fuse the detection results. Our method demonstrates improvements over baseline models in vision-based defect detection. The model is open for the integration of modalities other than an image, e.g., sound and accelerometer data.

摘要

铁路线路上的缺陷检测对于确保安全高效的运输至关重要。当前使用深度神经网络（DNN）进行缺陷检测的图像分析方法通常只关注缺陷本身，而忽略了相关的上下文信息。在这项工作中，我们提出了一种融合模型，该模型结合了有针对性的缺陷搜索和上下文分析，这被视为一个多模态融合任务。我们的模型执行基于规则的决策级融合，合并多个单独模型的置信度分数以对铁路线路缺陷进行分类。我们称该模型为“混合”模型，因为它由监督学习组件和基于规则的融合组成。我们首先通过在你只看一次（YOLO）版本5（YOLOv5）和8（YOLOv8）架构中纳入卷积块注意力模块（CBAM）来改进现有的基于视觉的缺陷检测方法，以检测缺陷和上下文图像元素。这个注意力模块应用于不同的检测尺度。应用领域知识规则来融合检测结果。我们的方法在基于视觉的缺陷检测中比基线模型有改进。该模型开放用于集成除图像之外的其他模态，例如声音和加速度计数据。