Guan Siwei, He Zhiwei, Ma Shenhui, Gao Mingyu
School of Electronic Information, Hangzhou Dianzi University, Hangzhou, Zhejiang Province, China; Zhejiang Provincial Key Lab of Equipment Electronics, Hangzhou, Zhejiang Province, China.
ISA Trans. 2023 Dec;143:231-243. doi: 10.1016/j.isatra.2023.09.002. Epub 2023 Sep 5.
Multivariate time series data is becoming increasingly ubiquitous in various fields such as servers, industrial applications, and healthcare. However, detecting anomalies in such data is challenging due to its complex time-dependent, high-dimensional, and label scarcity. Aiming at this problem, this paper proposes an Attention Factorization Normalizing Flow (AFNF) algorithm for unsupervised multivariate time series anomaly detection. Our hypothesis is that anomalies are in a low-density region of the distribution. To transform the complex density of high-dimensional time series into a simple evaluable conditional density, we propose a time series factorization strategy and parameterize the conditional information generated by factorization in the time and attribute dimensions using an attention mechanism. Moreover, to compensate for the lack of temporal information due to the permutation invariance attention mechanism, a adjacency contrasting approach is proposed to model the local invariance of the time series. To provide long-term location information, a learnable global location encoding is introduced. Conditional normalizing flows are applied to evaluate the conditional probability of the observations. Finally, through extensive experiments on three real data sets, our method yielded the best results and its effectiveness in density estimation and anomaly detection is demonstrated.
多元时间序列数据在服务器、工业应用和医疗保健等各个领域中变得越来越普遍。然而,由于此类数据复杂的时间依赖性、高维度性和标签稀缺性,检测其中的异常具有挑战性。针对这一问题,本文提出了一种用于无监督多元时间序列异常检测的注意力因子归一化流(AFNF)算法。我们的假设是异常位于分布的低密度区域。为了将高维时间序列的复杂密度转换为简单可评估的条件密度,我们提出了一种时间序列分解策略,并使用注意力机制在时间和属性维度上对分解产生的条件信息进行参数化。此外,为了弥补由于排列不变注意力机制导致的时间信息不足,提出了一种邻接对比方法来建模时间序列的局部不变性。为了提供长期位置信息,引入了一种可学习的全局位置编码。条件归一化流用于评估观测值的条件概率。最后,通过在三个真实数据集上进行的大量实验,我们的方法取得了最佳结果,并证明了其在密度估计和异常检测方面的有效性。