• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有多个最小支持度的关联规则挖掘:一种新的挖掘算法和支持度调整机制。

Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism.

作者信息

Hu Ya-Han, Chen Yen-Liang

机构信息

Department of Information Management, National Central University, Chung-Li 320, Taiwan, ROC.

出版信息

Decis Support Syst. 2006 Oct;42(1):1-24. doi: 10.1016/j.dss.2004.09.007. Epub 2004 Nov 30.

DOI:10.1016/j.dss.2004.09.007
PMID:32287563
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7127670/
Abstract

Mining association rules with multiple minimum supports is an important generalization of the association-rule-mining problem, which was recently proposed by Liu et al. Instead of setting a single minimum support threshold for all items, they allow users to specify multiple minimum supports to reflect the natures of the items, and an Apriori-based algorithm, named MSapriori, is developed to mine all frequent itemsets. In this paper, we study the same problem but with two additional improvements. First, we propose a FP-tree-like structure, MIS-tree, to store the crucial information about frequent patterns. Accordingly, an efficient MIS-tree-based algorithm, called the CFP-growth algorithm, is developed for mining all frequent itemsets. Second, since each item can have its own minimum support, it is very difficult for users to set the appropriate thresholds for all items at a time. In practice, users need to tune items' supports and run the mining algorithm repeatedly until a satisfactory end is reached. To speed up this time-consuming tuning process, an efficient algorithm which can maintain the MIS-tree structure without rescanning database is proposed. Experiments on both synthetic and real-life datasets show that our algorithms are much more efficient and scalable than the previous algorithm.

摘要

挖掘具有多个最小支持度的关联规则是关联规则挖掘问题的一个重要推广,这是Liu等人最近提出的。他们不是为所有项设置单个最小支持度阈值,而是允许用户指定多个最小支持度以反映项的特性,并开发了一种基于Apriori的算法MSapriori来挖掘所有频繁项集。在本文中,我们研究相同的问题,但有两个额外的改进。首先,我们提出一种类似FP树的结构MIS树,用于存储关于频繁模式的关键信息。相应地,开发了一种高效的基于MIS树的算法CFP-growth算法来挖掘所有频繁项集。其次,由于每个项可以有自己的最小支持度,用户很难一次为所有项设置合适的阈值。在实践中,用户需要调整项的支持度并反复运行挖掘算法,直到达到满意的结果。为了加速这个耗时的调整过程,提出了一种高效算法,该算法可以在不重新扫描数据库的情况下维护MIS树结构。在合成数据集和真实数据集上的实验表明,我们的算法比以前的算法更高效且更具可扩展性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e582/7127670/834e676a5bc1/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e582/7127670/9db9f3fa5489/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e582/7127670/3418481bb469/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e582/7127670/af352d82b458/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e582/7127670/8307ec96217a/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e582/7127670/7d2bfff344ff/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e582/7127670/74cad178c9bc/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e582/7127670/aadbc6fe581f/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e582/7127670/4608f3c95c83/gr8af.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e582/7127670/834e676a5bc1/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e582/7127670/9db9f3fa5489/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e582/7127670/3418481bb469/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e582/7127670/af352d82b458/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e582/7127670/8307ec96217a/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e582/7127670/7d2bfff344ff/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e582/7127670/74cad178c9bc/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e582/7127670/aadbc6fe581f/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e582/7127670/4608f3c95c83/gr8af.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e582/7127670/834e676a5bc1/gr9.jpg

相似文献

1
Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism.具有多个最小支持度的关联规则挖掘:一种新的挖掘算法和支持度调整机制。
Decis Support Syst. 2006 Oct;42(1):1-24. doi: 10.1016/j.dss.2004.09.007. Epub 2004 Nov 30.
2
An efficient pattern growth approach for mining fault tolerant frequent itemsets.一种用于挖掘容错频繁项集的高效模式增长方法。
Expert Syst Appl. 2020 Apr 1;143:113046. doi: 10.1016/j.eswa.2019.113046. Epub 2019 Oct 21.
3
The Mining Algorithm of Maximum Frequent Itemsets Based on Frequent Pattern Tree.基于频繁模式树的最大频繁项集挖掘算法。
Comput Intell Neurosci. 2022 May 18;2022:7022168. doi: 10.1155/2022/7022168. eCollection 2022.
4
Efficient Top-K Identical Frequent Itemsets Mining without Support Threshold Parameter from Transactional Datasets Produced by IoT-Based Smart Shopping Carts.从基于物联网的智能购物车生成的事务性数据集高效挖掘无支持阈值参数的 Top-K 相同频繁项集。
Sensors (Basel). 2022 Oct 21;22(20):8063. doi: 10.3390/s22208063.
5
PARM--an efficient algorithm to mine association rules from spatial data.PARM——一种从空间数据中挖掘关联规则的高效算法。
IEEE Trans Syst Man Cybern B Cybern. 2008 Dec;38(6):1513-24. doi: 10.1109/TSMCB.2008.927730.
6
DTFP-Growth: Dynamic Threshold-Based FP-Growth Rule Mining Algorithm Through Integrating Gene Expression, Methylation, and Protein-Protein Interaction Profiles.DTFP-Growth:通过整合基因表达、甲基化和蛋白质-蛋白质相互作用谱的基于动态阈值的 FP 增长规则挖掘算法。
IEEE Trans Nanobioscience. 2018 Apr;17(2):117-125. doi: 10.1109/TNB.2018.2803021.
7
Efficient Algorithm for Mining Non-Redundant High-Utility Association Rules.高效挖掘非冗余高效用关联规则的算法。
Sensors (Basel). 2020 Feb 17;20(4):1078. doi: 10.3390/s20041078.
8
RANWAR: rank-based weighted association rule mining from gene expression and methylation data.RANWAR:从基因表达和甲基化数据中进行基于秩的加权关联规则挖掘。
IEEE Trans Nanobioscience. 2015 Jan;14(1):59-66. doi: 10.1109/TNB.2014.2359494. Epub 2014 Sep 23.
9
An incremental high-utility mining algorithm with transaction insertion.一种带有事务插入的增量式高实用性挖掘算法。
ScientificWorldJournal. 2015;2015:161564. doi: 10.1155/2015/161564. Epub 2015 Feb 25.
10
Confabulation-inspired association rule mining for rare and frequent itemsets.基于虚构联想的关联规则挖掘稀有和频繁项集。
IEEE Trans Neural Netw Learn Syst. 2014 Nov;25(11):2053-64. doi: 10.1109/TNNLS.2014.2303137.

引用本文的文献

1
Molecular Aspects of Geriatric Pharmacotherapy.老年药物治疗的分子层面
Cells. 2025 Sep 1;14(17):1363. doi: 10.3390/cells14171363.
2
Extraction frequent patterns in trauma dataset based on automatic generation of minimum support and feature weighting.基于最小支持自动生成和特征加权的创伤数据集频繁模式提取。
BMC Med Res Methodol. 2024 Feb 16;24(1):40. doi: 10.1186/s12874-024-02154-0.
3
Analysis of Acupoints Combination for Cancer-Related Anorexia Based on Association Rule Mining.基于关联规则挖掘的癌症相关性厌食症穴位配伍分析
Evid Based Complement Alternat Med. 2022 Oct 18;2022:4251458. doi: 10.1155/2022/4251458. eCollection 2022.
4
A novel association rule mining approach using TID intermediate itemset.一种使用事务标识(TID)中间项集的新型关联规则挖掘方法。
PLoS One. 2018 Jan 19;13(1):e0179703. doi: 10.1371/journal.pone.0179703. eCollection 2018.
5
Assessing Lightning and Wildfire Hazard by Land Properties and Cloud to Ground Lightning Data with Association Rule Mining in Alberta, Canada.利用加拿大艾伯塔省的土地属性和云对地闪电数据以及关联规则挖掘评估闪电和野火危害
Sensors (Basel). 2017 Oct 23;17(10):2413. doi: 10.3390/s17102413.
6
Building an associative classifier with multiple minimum supports.构建具有多个最小支持度的关联分类器。
Springerplus. 2016 Apr 26;5:528. doi: 10.1186/s40064-016-2153-1. eCollection 2016.