Department of Computer Engineering, Chung-Ang University, Dongjak-gu, Seoul, Republic of Korea.
PLoS One. 2021 Feb 18;16(2):e0247119. doi: 10.1371/journal.pone.0247119. eCollection 2021.
Existing dynamic graph embedding-based outlier detection methods mainly focus on the evolution of graphs and ignore the similarities among them. To overcome this limitation for the effective detection of abnormal climatic events from meteorological time series, we proposed a dynamic graph embedding model based on graph proximity, called DynGPE. Climatic events are represented as a graph where each vertex indicates meteorological data and each edge indicates a spurious relationship between two meteorological time series that are not causally related. The graph proximity is described as the distance between two graphs. DynGPE can cluster similar climatic events in the embedding space. Abnormal climatic events are distant from most of the other events and can be detected using outlier detection methods. We conducted experiments by applying three outlier detection methods (i.e., isolation forest, local outlier factor, and box plot) to real meteorological data. The results showed that DynGPE achieves better results than the baseline by 44.3% on average in terms of the F-measure. Isolation forest provides the best performance and stability. It achieved higher results than the local outlier factor and box plot methods, namely, by 15.4% and 78.9% on average, respectively.
现有的基于动态图嵌入的异常检测方法主要关注图的演变,而忽略了它们之间的相似性。为了克服这一限制,有效地从气象时间序列中检测异常气候事件,我们提出了一种基于图相似度的动态图嵌入模型,称为 DynGPE。气候事件表示为一个图,其中每个顶点表示气象数据,每条边表示两个气象时间序列之间的虚假关系,这两个时间序列没有因果关系。图相似度描述为两个图之间的距离。DynGPE 可以在嵌入空间中对相似的气候事件进行聚类。异常气候事件与大多数其他事件相距较远,可以使用异常检测方法进行检测。我们通过将三种异常检测方法(即孤立森林、局部离群因子和箱线图)应用于真实气象数据来进行实验。结果表明,在 F 度量方面,DynGPE 平均比基线提高了 44.3%。孤立森林提供了最佳的性能和稳定性。它的结果比局部离群因子和箱线图方法分别高出 15.4%和 78.9%。