Mao Zehui, Xia Mingxuan, Jiang Bin, Xu Dezhi, Shi Peng
IEEE Trans Cybern. 2022 Aug;52(8):7624-7633. doi: 10.1109/TCYB.2020.3034929. Epub 2022 Jul 19.
Diagnosing the fault as early as possible is significant to guarantee the safety and reliability of the high-speed train. Incipient fault always makes the monitored signals deviate from their normal values, which may lead to serious consequences gradually. Due to the obscure early stage symptoms, incipient faults are difficult to detect. This article develops a stacked generalization (stacking)-based incipient fault diagnosis scheme for the traction system of high-speed trains. To extract the fault feature from the faulty data signals, which are similar to the normal ones, the extreme gradient boosting (XGBoost), random forest (RF), extra trees (ET), and light gradient boosting machine (LightGBM) are chosen as the base estimators in the first layer of the stacking. Then, the logistic regression (LR) is taken as the meta estimator in the second layer to integrate the results from the base estimators for fault classification. Thanks to the generalization ability of stacking, the incipient fault diagnosis performance of the proposed stacking-based method is better than that of the single model (XGBoost, RF, ET, and LightGBM), although they can be used to detect the incipient faults, separately. Moreover, to find out the optimal hyperparameters of the base estimators, a swarm intelligent optimization algorithm, pigeon-inspired optimization (PIO), is employed. The proposed method is tested on a semiphysical platform of the CRH2 traction system in CRRC Zhuzhou Locomotive Company Ltd. The results show that the fault diagnosis rate of the proposed scheme is over 96%.
尽早诊断故障对于保证高速列车的安全性和可靠性具有重要意义。早期故障总是会使监测信号偏离其正常值,这可能会逐渐导致严重后果。由于早期症状不明显,早期故障难以检测。本文针对高速列车牵引系统开发了一种基于堆叠泛化(stacking)的早期故障诊断方案。为了从与正常数据信号相似的故障数据信号中提取故障特征,选择极端梯度提升(XGBoost)、随机森林(RF)、Extra Trees(ET)和轻量级梯度提升机(LightGBM)作为堆叠第一层的基估计器。然后,将逻辑回归(LR)作为第二层的元估计器,对基估计器的结果进行整合以进行故障分类。由于堆叠的泛化能力,尽管所提出的基于堆叠的方法中的单个模型(XGBoost、RF、ET和LightGBM)都可以单独用于检测早期故障,但其早期故障诊断性能优于单个模型。此外,为了找出基估计器的最优超参数,采用了一种群体智能优化算法——鸽群启发优化(PIO)。所提出的方法在中车株洲电力机车有限公司的CRH2牵引系统半物理平台上进行了测试。结果表明,所提方案的故障诊断率超过96%。