Namoano Bernadin, Latsou Christina, Erkoyuncu John Ahmet
Centre of Digital Engineering and Manufacturing, Cranfield University, College Rd, Wharley End, Bedford, MK43 0AL UK.
J Intell Manuf. 2025;36(6):4319-4330. doi: 10.1007/s10845-024-02447-7. Epub 2024 Jul 13.
Anomaly detection in multivariate time-series data is critical for monitoring asset conditions, enabling prompt fault detection and diagnosis to mitigate damage, reduce downtime and enhance safety. Existing literature predominately emphasises temporal dependencies in single-channel data, often overlooking interrelations between features in multivariate time-series data and across multiple channels. This paper introduces G-BOCPD, a novel graphical model-based annotation method designed to automatically detect anomalies in multi-channel multivariate time-series data. To address internal and external dependencies, G-BOCPD proposes a hybridisation of the graphical lasso and expectation maximisation algorithms. This approach detects anomalies in multi-channel multivariate time-series by identifying segments with diverse behaviours and patterns, which are then annotated to highlight variations. The method alternates between estimating the concentration matrix, which represents dependencies between variables, using the graphical lasso algorithm, and annotating segments through a minimal path clustering method for a comprehensive understanding of variations. To demonstrate its effectiveness, G-BOCPD is applied to multichannel time-series obtained from: (i) Diesel Multiple Unit train engines exhibiting faulty behaviours; and (ii) a group of train doors at various degradation stages. Empirical evidence highlights G-BOCPD's superior performance compared to previous approaches in terms of precision, recall and F1-score.
多变量时间序列数据中的异常检测对于监测资产状况至关重要,能够实现及时的故障检测与诊断,以减轻损害、减少停机时间并提高安全性。现有文献主要强调单通道数据中的时间依赖性,常常忽略多变量时间序列数据中各特征之间以及多个通道之间的相互关系。本文介绍了G-BOCPD,这是一种基于图形模型的新型注释方法,旨在自动检测多通道多变量时间序列数据中的异常。为了解决内部和外部依赖性问题,G-BOCPD提出了图形套索算法和期望最大化算法的混合方法。该方法通过识别具有不同行为和模式的片段来检测多通道多变量时间序列中的异常,然后对这些片段进行注释以突出变化。该方法在使用图形套索算法估计表示变量之间依赖性的浓度矩阵和通过最小路径聚类方法注释片段之间交替进行,以便全面了解变化情况。为了证明其有效性,G-BOCPD应用于从以下方面获得的多通道时间序列:(i)表现出故障行为的柴油动车组列车发动机;以及(ii)处于不同退化阶段的一组列车车门。实证证据表明,与先前方法相比,G-BOCPD在精度、召回率和F1分数方面具有卓越的性能。