Upadhya K Jyothi, Lobo Ronan, Chhabra Mini Shail, Paleja Aman, Rao B Dinesh, M Geetha, Sisodia Prachi, Reddy Bolusani Akshita
Department of Computer Science and Engineering, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, Karnataka, India.
Manipal School of Information Sciences, Manipal Academy of Higher Education, Manipal, Karnataka, India.
Front Big Data. 2025 Jun 4;8:1600267. doi: 10.3389/fdata.2025.1600267. eCollection 2025.
Periodic pattern mining, a branch of data mining, is expanding to provide insight into the occurrence behavior of large volumes of data. Recently, a variety of industries, including fraud detection, telecommunications, retail marketing, research, and medical have found applications for rare association rule mining, which uncovers unusual or unexpected combinations. A limited amount of literature demonstrated how periodicity is essential in mining low-support rare patterns. In addition, attention must be placed on temporal datasets that analyze crucial information about the timing of pattern occurrences and stream datasets to manage high-speed streaming data. Several algorithms have been developed that effectively track the cyclic behavior of patterns and identify the patterns that display complete or partial periodic behavior in temporal datasets. Numerous frameworks have been created to examine the periodic behavior of streaming data. Nevertheless, such a method that focuses on the temporal information in the data stream and extracts rare partial periodic patterns has yet to be proposed. With a focus on identifying rare partial periodic patterns from temporal data streams, this paper proposes two novel sliding window-based single scan approaches called and . The findings showed that when a dense dataset is considered, for different threshold variations outperformed by about 93%. Similarly, when the sparse dataset is taken into account, exhibits a 90% boost in performance. This demonstrates that on a range of synthetic, real-world, sparse, and dense datasets for different thresholds, is significantly faster than .
周期模式挖掘作为数据挖掘的一个分支,正在不断扩展,以深入了解大量数据的出现行为。最近,包括欺诈检测、电信、零售营销、研究和医疗在内的各种行业都发现了罕见关联规则挖掘的应用,这种挖掘可以揭示不寻常或意外的组合。有限的文献表明了周期性在挖掘低支持度罕见模式中的重要性。此外,必须关注分析模式出现时间关键信息的时间数据集以及管理高速流数据的流数据集。已经开发了几种算法,它们可以有效地跟踪模式的循环行为,并识别在时间数据集中显示完整或部分周期性行为的模式。已经创建了许多框架来检查流数据的周期性行为。然而,尚未提出一种专注于数据流中的时间信息并提取罕见部分周期性模式的方法。本文着重于从时间数据流中识别罕见的部分周期性模式,提出了两种新颖的基于滑动窗口的单扫描方法,分别称为 和 。研究结果表明,当考虑密集数据集 时,对于不同的阈值变化, 比 性能高出约93%。同样,当考虑稀疏数据集 时, 性能提升了90%。这表明在一系列针对不同阈值的合成、真实世界、稀疏和密集数据集上, 比 显著更快。