Liu Kai, Lu Ningyun, Wu Feng, Zhang Ridong, Gao Furong
IEEE Trans Cybern. 2023 Oct;53(10):6465-6478. doi: 10.1109/TCYB.2022.3176475. Epub 2023 Sep 15.
The data generated by modern industrial processes often exhibit high-dimensional, nonlinear, timing, and multiscale characteristics. Presently, most of the fault diagnosis methods based on deep learning only consider the part of the characteristics of industrial data, which will cause the loss of part of the feature information during training, thereby affecting the final diagnosis effect. In order to solve the above problems, this article proposes an end-to-end multiscale feature learning method based on model fusion, which can simultaneously extract multiscale spatial features and temporal features of data, effectively reducing the loss of feature information. First, this article combines the convolutional neural network (CNN) with residual learning and designs a multiscale residual network (MRCNN) to extract high-dimensional nonlinear spatial features of different scales in the data. Then, the extracted features are input into the long and short-term memory (LSTM) network to further extract the temporal features of the data. After the fully connected layer, it is input into the classifier for final fault classification. The residual learning in MRCNN can effectively avoid the problem of model degradation and improve the training efficiency of the model. Through the fusion of MRCNN and LSTM, we can significantly improve the feature extraction ability of the model, thereby greatly improving the diagnosis effect. In the final case experiment, the method improved the comprehensive diagnostic accuracy of the Tennessee-Eastman (TE) process and industrial coking furnace datasets to 94.43% and 97.80%, respectively, which was significantly better than the existing deep learning model and proves the effectiveness and superiority of this method.
现代工业过程产生的数据往往具有高维、非线性、时序和多尺度特征。目前,大多数基于深度学习的故障诊断方法仅考虑了工业数据部分特征,这会导致训练过程中部分特征信息丢失,进而影响最终诊断效果。为解决上述问题,本文提出一种基于模型融合的端到端多尺度特征学习方法,该方法能够同时提取数据的多尺度空间特征和时序特征,有效减少特征信息损失。首先,本文将卷积神经网络(CNN)与残差学习相结合,设计了一种多尺度残差网络(MRCNN)来提取数据中不同尺度的高维非线性空间特征。然后,将提取的特征输入到长短时记忆(LSTM)网络中,进一步提取数据的时序特征。经过全连接层后,将其输入到分类器进行最终的故障分类。MRCNN中的残差学习能够有效避免模型退化问题,提高模型的训练效率。通过MRCNN与LSTM的融合,可以显著提高模型的特征提取能力,从而大大提高诊断效果。在最后的案例实验中,该方法将田纳西-伊斯曼(TE)过程和工业焦炉数据集的综合诊断准确率分别提高到了94.43%和97.80%,明显优于现有的深度学习模型,证明了该方法的有效性和优越性。