• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

不确定数据流中频繁模式的超结构挖掘

Hyper-structure mining of frequent patterns in uncertain data streams.

作者信息

Hewanadungodage Chandima, Xia Yuni, Lee Jaehwan John, Tu Yi-Cheng

机构信息

Department of Computer and Information Science, Indiana University-Purdue University Indianapolis, 723 West Michigan Street, Indianapolis, IN 46202-5132, USA.

Department of Electrical and Computer Engineering, Indiana University-Purdue University Indianapolis, 723 West Michigan Street, Indianapolis, IN 46202-5132, USA.

出版信息

Knowl Inf Syst. 2013 Oct 1;37(1):219-244. doi: 10.1007/s10115-012-0581-y.

DOI:10.1007/s10115-012-0581-y
PMID:24729652
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3983695/
Abstract

Data uncertainty is inherent in many real-world applications such as sensor monitoring systems, location-based services, and medical diagnostic systems. Moreover, many real-world applications are now capable of producing continuous, unbounded data streams. During the recent years, new methods have been developed to find frequent patterns in uncertain databases; nevertheless, very limited work has been done in discovering frequent patterns in uncertain data streams. The current solutions for frequent pattern mining in uncertain streams take a FP-tree-based approach; however, recent studies have shown that FP-tree-based algorithms do not perform well in the presence of data uncertainty. In this paper, we propose two hyper-structure-based false-positive-oriented algorithms to efficiently mine frequent itemsets from streams of uncertain data. The first algorithm, UHS-Stream, is designed to find all frequent itemsets up to the current moment. The second algorithm, TFUHS-Stream, is designed to find frequent itemsets in an uncertain data stream in a time-fading manner. Experimental results show that the proposed hyper-structure-based algorithms outperform the existing tree-based algorithms in terms of accuracy, runtime, and memory usage.

摘要

数据不确定性在许多现实世界的应用中都存在,如传感器监测系统、基于位置的服务和医疗诊断系统。此外,许多现实世界的应用现在能够产生连续的、无界的数据流。近年来,已经开发出了新的方法来在不确定数据库中发现频繁模式;然而,在不确定数据流中发现频繁模式的工作却非常有限。当前用于不确定流中频繁模式挖掘的解决方案采用基于FP树的方法;然而,最近的研究表明,基于FP树的算法在存在数据不确定性的情况下表现不佳。在本文中,我们提出了两种基于超结构的面向误报的算法,以有效地从不确定数据流中挖掘频繁项集。第一种算法UHS-Stream旨在找到截至当前时刻的所有频繁项集。第二种算法TFUHS-Stream旨在以时间衰减的方式在不确定数据流中找到频繁项集。实验结果表明,所提出的基于超结构的算法在准确性、运行时间和内存使用方面优于现有的基于树的算法。

相似文献

1
Hyper-structure mining of frequent patterns in uncertain data streams.不确定数据流中频繁模式的超结构挖掘
Knowl Inf Syst. 2013 Oct 1;37(1):219-244. doi: 10.1007/s10115-012-0581-y.
2
The Mining Algorithm of Maximum Frequent Itemsets Based on Frequent Pattern Tree.基于频繁模式树的最大频繁项集挖掘算法。
Comput Intell Neurosci. 2022 May 18;2022:7022168. doi: 10.1155/2022/7022168. eCollection 2022.
3
An efficient pattern growth approach for mining fault tolerant frequent itemsets.一种用于挖掘容错频繁项集的高效模式增长方法。
Expert Syst Appl. 2020 Apr 1;143:113046. doi: 10.1016/j.eswa.2019.113046. Epub 2019 Oct 21.
4
LCTree-Based Approach for Mining Frequent Items in Real-Time.基于 LCTree 的实时频繁项挖掘方法。
Comput Intell Neurosci. 2022 Oct 14;2022:7430106. doi: 10.1155/2022/7430106. eCollection 2022.
5
Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism.具有多个最小支持度的关联规则挖掘:一种新的挖掘算法和支持度调整机制。
Decis Support Syst. 2006 Oct;42(1):1-24. doi: 10.1016/j.dss.2004.09.007. Epub 2004 Nov 30.
6
Mining of high utility-probability sequential patterns from uncertain databases.从不确定数据库中挖掘高效用概率序列模式。
PLoS One. 2017 Jul 25;12(7):e0180931. doi: 10.1371/journal.pone.0180931. eCollection 2017.
7
Negative and positive association rules mining from text using frequent and infrequent itemsets.基于频繁和非频繁项集从文本中挖掘正负关联规则。
ScientificWorldJournal. 2014;2014:973750. doi: 10.1155/2014/973750. Epub 2014 May 18.
8
Efficient Top-K Identical Frequent Itemsets Mining without Support Threshold Parameter from Transactional Datasets Produced by IoT-Based Smart Shopping Carts.从基于物联网的智能购物车生成的事务性数据集高效挖掘无支持阈值参数的 Top-K 相同频繁项集。
Sensors (Basel). 2022 Oct 21;22(20):8063. doi: 10.3390/s22208063.
9
An incremental high-utility mining algorithm with transaction insertion.一种带有事务插入的增量式高实用性挖掘算法。
ScientificWorldJournal. 2015;2015:161564. doi: 10.1155/2015/161564. Epub 2015 Feb 25.
10
Diagnosis of coronary artery disease using an efficient hash table based closed frequent itemsets mining.使用基于高效哈希表的封闭频繁项集挖掘技术诊断冠状动脉疾病。
Med Biol Eng Comput. 2018 May;56(5):749-759. doi: 10.1007/s11517-017-1719-6. Epub 2017 Sep 14.