Shi Yongjie, Guo Jiang, Tian Jiale, Yi Tongqiang, Meng Yang, Tian Zhong
School of Power and Mechanical Engineering, Wuhan University, Wuhan 430072, China.
School of Electrical Engineering, Xi'an Jiaotong University, Xi'an 710049, China.
Sensors (Basel). 2025 Aug 22;25(17):5216. doi: 10.3390/s25175216.
Online monitoring systems continuously collect massive multivariate time-series data from converter transformers. Accurate outlier detection in these data is essential for identifying sensor faults, communication errors, and incipient equipment failures, thereby ensuring reliable condition assessment and maintenance decisions. However, the complex characteristics of transformer monitoring data-including non-Gaussian distributions from diverse operational modes, high dimensionality, and multi-scale temporal dependencies-render traditional outlier detection methods ineffective. This paper proposes a Multi-View Clustering-based Outlier Detection (MVCOD) framework that addresses these challenges through complementary data representations. The framework constructs four complementary data views-raw-differential, multi-scale temporal, density-enhanced, and manifold representations-and applies four detection algorithms (K-means, HDBSCAN, OPTICS, and Isolation Forest) to each view. An adaptive fusion mechanism dynamically weights the 16 detection results based on quality and complementarity metrics. Extensive experiments on 800 kV converter transformer operational data demonstrate that MVCOD achieves a Silhouette Coefficient of 0.68 and an Outlier Separation Score of 0.81, representing 30.8% and 35.0% improvements over the best baseline method, respectively. The framework successfully identifies 10.08% of data points as outliers with feature-level localization capabilities. This work provides an effective and interpretable solution for ensuring data quality in converter transformer monitoring systems, with potential applications to other complex industrial time-series data.
在线监测系统持续从换流变压器收集海量多变量时间序列数据。对这些数据进行准确的异常检测对于识别传感器故障、通信错误和早期设备故障至关重要,从而确保可靠的状态评估和维护决策。然而,变压器监测数据的复杂特性——包括来自不同运行模式的非高斯分布、高维度和多尺度时间依赖性——使得传统的异常检测方法失效。本文提出了一种基于多视图聚类的异常检测(MVCOD)框架,该框架通过互补的数据表示来应对这些挑战。该框架构建了四个互补的数据视图——原始差分、多尺度时间、密度增强和流形表示——并将四种检测算法(K均值、HDBSCAN、OPTICS和孤立森林)应用于每个视图。一种自适应融合机制基于质量和互补性指标对16个检测结果进行动态加权。对800 kV换流变压器运行数据进行的大量实验表明,MVCOD的轮廓系数为0.68,异常分离分数为0.81,分别比最佳基线方法提高了30.8%和35.0%。该框架成功地将10.08%的数据点识别为具有特征级定位能力的异常值。这项工作为确保换流变压器监测系统中的数据质量提供了一种有效且可解释的解决方案,并有可能应用于其他复杂的工业时间序列数据。