Duan Feng, Zhang Shuai, Yan Yinze, Cai Zhiqiang
School of Mechanical Engineering, Northwestern Polytechnical University, Xi'an 710072, China.
Sensors (Basel). 2022 Jul 10;22(14):5166. doi: 10.3390/s22145166.
With the development of machine learning, data-driven mechanical fault diagnosis methods have been widely used in the field of PHM. Due to the limitation of the amount of fault data, it is a difficult problem for fault diagnosis to solve the problem of unbalanced data sets. Under unbalanced data sets, faults with little historical data are always difficult to diagnose and lead to economic losses. In order to improve the prediction accuracy under unbalanced data sets, this paper proposes MeanRadius-SMOTE based on the traditional SMOTE oversampling algorithm, which effectively avoids the generation of useless samples and noise samples. This paper validates the effectiveness of the algorithm on three linear unbalanced data sets and four step unbalanced data sets. Experimental results show that MeanRadius-SMOTE outperforms SMOTE and LR-SMOTE in various evaluation indicators, as well as has better robustness against different imbalance rates. In addition, MeanRadius-SMOTE can take into account the prediction accuracy of the overall and minority class, which is of great significance for engineering applications.
随着机器学习的发展,数据驱动的机械故障诊断方法已在故障预测与健康管理(PHM)领域得到广泛应用。由于故障数据量的限制,解决数据集不平衡问题是故障诊断中的一个难题。在不平衡数据集下,历史数据少的故障总是难以诊断,并会导致经济损失。为了提高不平衡数据集下的预测精度,本文基于传统的SMOTE过采样算法提出了MeanRadius-SMOTE算法,该算法有效避免了无用样本和噪声样本的产生。本文在三个线性不平衡数据集和四个阶跃不平衡数据集上验证了该算法的有效性。实验结果表明,MeanRadius-SMOTE在各项评估指标上均优于SMOTE和LR-SMOTE,并且对不同的不平衡率具有更好的鲁棒性。此外,MeanRadius-SMOTE能够兼顾总体和少数类的预测精度,这对工程应用具有重要意义。