Railway Infrastructure Inspection Institute, China Academy of Railway Science, Beijing, China.
PLoS One. 2022 May 17;17(5):e0268518. doi: 10.1371/journal.pone.0268518. eCollection 2022.
The detection of rail surface defects is vital for high-speed rail maintenance and management. The CNN-based computer vision approach has been proved to be a strong detection tool widely used in various industrial scenarios. However, the CNN-based detection models are diverse from each other in performance, and most of them require sufficient training samples to achieve high detection performance. Selecting an appropriate model and tuning it with insufficient annotated rail defect images is time-consuming and tedious. To overcome this challenge, motivated by ensemble learning that uses multiple learning algorithms to obtain better predictive performance, we develop an ensemble framework for industrialized rail defect detection. We apply multiple backbone networks individually to obtain features, and mix them in a binary format to obtain better and more diverse sub-networks. Image augmentation and feature augmentation operations are randomly applied to further make the model more diverse. A shared feature pyramid network is adopted to reduce model parameters as well as computation cost. Experimental results substantiate that the approach outperforms single detecting architecture in our specified rail defect task. On the collected dataset with 8 defect classes, our algorithm achieves 7.4% higher mAP.5 compared with YOLOv5 and 2.8% higher mAP.5 compared with Faster R-CNN.
铁轨表面缺陷的检测对高速铁路的维护和管理至关重要。基于卷积神经网络(CNN)的计算机视觉方法已被证明是一种强大的检测工具,广泛应用于各种工业场景。然而,基于 CNN 的检测模型在性能上存在差异,并且大多数模型都需要足够的训练样本才能实现高检测性能。选择合适的模型并使用不足的标注铁轨缺陷图像进行调整既耗时又乏味。为了克服这一挑战,受集成学习的启发,该学习方法使用多个学习算法来获得更好的预测性能,我们为工业化铁轨缺陷检测开发了一个集成框架。我们分别应用多个骨干网络来获取特征,并以二进制格式混合它们,以获得更好和更多样化的子网络。随机应用图像增强和特征增强操作,以使模型更加多样化。采用共享特征金字塔网络来减少模型参数和计算成本。实验结果证实,该方法在我们指定的铁轨缺陷任务中优于单检测架构。在收集到的 8 个缺陷类别的数据集上,我们的算法在 mAP.5 上比 YOLOv5 高出 7.4%,比 Faster R-CNN 高出 2.8%。