Suppr超能文献

OPP-Miner:用于时间序列的保序序贯模式挖掘。

OPP-Miner: Order-Preserving Sequential Pattern Mining for Time Series.

出版信息

IEEE Trans Cybern. 2023 May;53(5):3288-3300. doi: 10.1109/TCYB.2022.3169327. Epub 2023 Apr 21.

Abstract

Traditional sequential pattern mining methods were designed for symbolic sequence. As a collection of measurements in chronological order, a time series needs to be discretized into symbolic sequences, and then users can apply sequential pattern mining methods to discover interesting patterns in time series. The discretization will not only cause the loss of some important information, which partially destroys the continuity of time series, but also ignore the order relations between time-series values. Inspired by order-preserving matching, this article explores a new method called order-preserving sequential pattern (OPP) mining, which does not need to discretize time series into symbolic sequences and represents patterns based on the order relations of time series. An inherent advantage of such representation is that the trend of a time series can be represented by the relative order of the values underneath time series. We propose an OPP-Miner algorithm to mine frequent patterns in time series with the same relative order. OPP-Miner employs the filtration and verification strategies to calculate the support and uses the pattern fusion strategy to generate candidate patterns. To compress the result set, we also study to find the maximal OPPs. Experimental results validate that OPP-Miner is not only efficient but can also discover similar subsequences in time series. In addition, case studies show that our algorithms have high utility in analyzing the COVID-19 epidemic by identifying critical trends and improve the clustering performance. The algorithms and data can be downloaded from https://github.com/wuc567/Pattern-Mining/tree/master/OPP-Miner.

摘要

传统的序列模式挖掘方法是为符号序列设计的。作为按时间顺序排列的测量值集合,时间序列需要离散化为符号序列,然后用户可以应用序列模式挖掘方法来发现时间序列中的有趣模式。这种离散化不仅会导致一些重要信息的丢失,从而部分破坏时间序列的连续性,而且还会忽略时间序列值之间的顺序关系。受顺序保持匹配的启发,本文探索了一种新的方法,称为顺序保持序列模式(OPP)挖掘,它不需要将时间序列离散化为符号序列,而是基于时间序列的顺序关系来表示模式。这种表示方法的一个固有优势是,时间序列的趋势可以通过时间序列下数值的相对顺序来表示。我们提出了一种 OPP-Miner 算法,用于挖掘时间序列中具有相同相对顺序的频繁模式。OPP-Miner 采用过滤和验证策略来计算支持度,并使用模式融合策略生成候选模式。为了压缩结果集,我们还研究了找到最大 OPP 的方法。实验结果验证了 OPP-Miner 不仅高效,而且可以在时间序列中发现相似的子序列。此外,案例研究表明,我们的算法通过识别关键趋势,在分析 COVID-19 疫情方面具有很高的实用性,并提高了聚类性能。算法和数据可以从 https://github.com/wuc567/Pattern-Mining/tree/master/OPP-Miner 下载。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验