Liu Siqi, Wright Adam, Hauskrecht Milos
Department of Computer Science, University of Pittsburgh.
Brigham and Women's Hospital and Harvard Medical School.
Proc Int Fla AI Res Soc Conf. 2017 May;2017:86-91.
The objective of this work is to develop methods for detecting outliers in time series data. Such methods can become the key component of various monitoring and alerting systems, where an outlier may be equal to some adverse condition that needs human attention. However, real-world time series are often affected by various sources of variability present in the environment that may influence the quality of detection; they may (1) explain some of the changes in the signal that would otherwise lead to false positive detections, as well as, (2) reduce the sensitivity of the detection algorithm leading to increase in false negatives. To alleviate these problems, we propose a new two-layer outlier detection approach that first tries to model and account for the nonstationarity and periodic variation in the time series, and then tries to use other observable variables in the environment to explain any additional signal variation. Our experiments on several data sets in different domains show that our method provides more accurate modeling of the time series, and that it is able to significantly improve outlier detection performance.
这项工作的目标是开发用于检测时间序列数据中异常值的方法。此类方法可成为各种监测和警报系统的关键组成部分,在这些系统中,异常值可能等同于某些需要人工关注的不利状况。然而,现实世界中的时间序列常常受到环境中各种变异性来源的影响,这些变异性可能会影响检测质量;它们可能(1)解释信号中的一些变化,否则这些变化会导致误报,以及(2)降低检测算法的灵敏度,导致漏报增加。为了缓解这些问题,我们提出了一种新的两层异常值检测方法,该方法首先尝试对时间序列中的非平稳性和周期性变化进行建模并加以考虑,然后尝试使用环境中的其他可观测变量来解释任何额外的信号变化。我们在不同领域的多个数据集上进行的实验表明,我们的方法能够对时间序列进行更准确的建模,并且能够显著提高异常值检测性能。