García-Méndez Silvia, de Arriba-Pérez Francisco, Leal Fátima, Veloso Bruno, Malheiro Benedita, Burguillo-Rial Juan Carlos
Information Technologies Group, atlanTTic, University of Vigo, Vigo, Spain.
REMIT, Universidade Portucalense, Porto, Portugal.
Sci Rep. 2025 Jul 28;15(1):27495. doi: 10.1038/s41598-025-08084-1.
The public transportation sector generates large volumes of sensor data that, if analyzed adequately, can help anticipate failures and initiate maintenance actions, thereby enhancing quality and productivity. This work contributes to a real-time data-driven predictive maintenance solution for Intelligent Transportation Systems. The proposed method implements a processing pipeline comprised of sample pre-processing, incremental classification with Machine Learning models, and outcome explanation. This novel online processing pipeline has two main highlights: (i) a dedicated sample pre-processing module, which builds statistical and frequency-related features on the fly, and (ii) an explainability module. This work is the first to perform online fault prediction with natural language and visual explainability. The experiments were performed with the MetroPT data set from the metro operator of Porto, Portugal. The results are above 98 % for F-measure and 99 % for accuracy. In the context of railway predictive maintenance, achieving these high values is crucial due to the practical and operational implications of accurate failure prediction. In the specific case of a high F-measure, this ensures that the system maintains an optimal balance between detecting the highest possible number of real faults and minimizing false alarms, which is crucial for maximizing service availability. Furthermore, the accuracy obtained enables reliability, directly impacting cost reduction and increased safety. The analysis demonstrates that the pipeline maintains high performance even in the presence of class imbalance and noise, and its explanations effectively reflect the decision-making process. These findings validate the methodological soundness of the approach and confirm its practical applicability for supporting proactive maintenance decisions in real-world railway operations. Therefore, by identifying the early signs of failure, this pipeline enables decision-makers to understand the underlying problems and act accordingly swiftly.
公共交通部门产生大量传感器数据,如果对这些数据进行充分分析,有助于预测故障并启动维护行动,从而提高质量和生产率。这项工作为智能交通系统的实时数据驱动预测性维护解决方案做出了贡献。所提出的方法实现了一个处理管道,该管道由样本预处理、使用机器学习模型进行增量分类以及结果解释组成。这个新颖的在线处理管道有两个主要亮点:(i)一个专用的样本预处理模块,它可以即时构建统计和频率相关特征;(ii)一个可解释性模块。这项工作首次进行了具有自然语言和视觉可解释性的在线故障预测。实验使用了来自葡萄牙波尔图地铁运营商的MetroPT数据集。F值结果高于98%,准确率高于99%。在铁路预测性维护的背景下,由于准确的故障预测具有实际和运营意义,实现这些高值至关重要。在高F值的具体情况下,这确保系统在检测尽可能多的实际故障和最小化误报之间保持最佳平衡,这对于最大化服务可用性至关重要。此外,所获得的准确率确保了可靠性,直接影响成本降低和安全性提高。分析表明,即使存在类别不平衡和噪声,该管道仍能保持高性能,并且其解释有效地反映了决策过程。这些发现验证了该方法的方法合理性,并证实了其在支持实际铁路运营中的主动维护决策方面的实际适用性。因此,通过识别故障的早期迹象,这个管道使决策者能够理解潜在问题并迅速采取相应行动。