Dong Shengli, Xu Xinghan, Chen Yuhang, Zhang Yifang, Wang Shengzheng
Merchant Marine College, Shanghai Maritime University, Shanghai 201306, China.
Shanghai Ship and Shipping Research Institute Co., Ltd., Shanghai 200135, China.
Entropy (Basel). 2024 Sep 25;26(10):815. doi: 10.3390/e26100815.
Currently, with the increasing scale of industrial systems, multisensor monitoring data exhibit large-scale dynamic Gaussian and non-Gaussian concurrent complex characteristics. However, the traditional principal component analysis method is based on Gaussian distribution and uncorrelated assumptions, which are greatly limited in practice. Therefore, developing a new fault detection method for large-scale Gaussian and non-Gaussian concurrent dynamic systems is one of the urgent challenges to be addressed. To this end, a double-layer distributed and integrated data-driven strategy based on Laplacian score weighting and integrated Bayesian inference is proposed. Specifically, in the first layer of the distributed strategy, we design a Jarque-Bera test module to divide all multisensor monitoring variables into Gaussian and non-Gaussian blocks, successfully solving the problem of different data distributions. In the second layer of the distributed strategy, we design a dynamic augmentation module to solve dynamic problems, a K-means clustering module to mine local similarity information of variables, and a Laplace scoring module to quantitatively evaluate the structural retention ability of variables. Therefore, this double-layer distributed strategy can simultaneously combine the different distribution characteristics, dynamism, local similarity, and importance of variables, comprehensively mining the local information of the multisensor data. In addition, we develop an integrated Bayesian inference strategy based on detection performance weighting, which can emphasize the differential contribution of local models. Finally, the fault detection results for the Tennessee Eastman production system and a diesel engine working system validate the superiority of the proposed method.
当前,随着工业系统规模的不断扩大,多传感器监测数据呈现出大规模动态高斯和非高斯并发的复杂特性。然而,传统的主成分分析方法基于高斯分布和不相关假设,在实际应用中受到很大限制。因此,开发一种针对大规模高斯和非高斯并发动态系统的新型故障检测方法是亟待解决的挑战之一。为此,提出了一种基于拉普拉斯分数加权和集成贝叶斯推理的双层分布式集成数据驱动策略。具体而言,在分布式策略的第一层,我们设计了一个Jarque-Bera检验模块,将所有多传感器监测变量划分为高斯和非高斯块,成功解决了不同数据分布的问题。在分布式策略的第二层,我们设计了一个动态增强模块来解决动态问题,一个K均值聚类模块来挖掘变量的局部相似性信息,以及一个拉普拉斯评分模块来定量评估变量的结构保留能力。因此,这种双层分布式策略可以同时结合变量的不同分布特征、动态性、局部相似性和重要性,全面挖掘多传感器数据的局部信息。此外,我们开发了一种基于检测性能加权的集成贝叶斯推理策略,该策略可以强调局部模型的差异贡献。最后,对田纳西伊士曼生产系统和柴油发动机工作系统的故障检测结果验证了所提方法的优越性。