• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用事务标识(TID)中间项集的新型关联规则挖掘方法。

A novel association rule mining approach using TID intermediate itemset.

作者信息

Aqra Iyad, Herawan Tutut, Abdul Ghani Norjihan, Akhunzada Adnan, Ali Akhtar, Bin Razali Ramdan, Ilahi Manzoor, Raymond Choo Kim-Kwang

机构信息

Department of Information Systems, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia

Department of Computer Science, COMSATS Institute of Information Technology (CIIT), Islamabad, Pakistan.

出版信息

PLoS One. 2018 Jan 19;13(1):e0179703. doi: 10.1371/journal.pone.0179703. eCollection 2018.

DOI:10.1371/journal.pone.0179703
PMID:29351287
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5774682/
Abstract

Designing an efficient association rule mining (ARM) algorithm for multilevel knowledge-based transactional databases that is appropriate for real-world deployments is of paramount concern. However, dynamic decision making that needs to modify the threshold either to minimize or maximize the output knowledge certainly necessitates the extant state-of-the-art algorithms to rescan the entire database. Subsequently, the process incurs heavy computation cost and is not feasible for real-time applications. The paper addresses efficiently the problem of threshold dynamic updation for a given purpose. The paper contributes by presenting a novel ARM approach that creates an intermediate itemset and applies a threshold to extract categorical frequent itemsets with diverse threshold values. Thus, improving the overall efficiency as we no longer needs to scan the whole database. After the entire itemset is built, we are able to obtain real support without the need of rebuilding the itemset (e.g. Itemset list is intersected to obtain the actual support). Moreover, the algorithm supports to extract many frequent itemsets according to a pre-determined minimum support with an independent purpose. Additionally, the experimental results of our proposed approach demonstrate the capability to be deployed in any mining system in a fully parallel mode; consequently, increasing the efficiency of the real-time association rules discovery process. The proposed approach outperforms the extant state-of-the-art and shows promising results that reduce computation cost, increase accuracy, and produce all possible itemsets.

摘要

为适用于实际部署的基于多级知识的事务数据库设计一种高效的关联规则挖掘(ARM)算法至关重要。然而,动态决策需要修改阈值以最小化或最大化输出知识,这必然要求现有的最先进算法重新扫描整个数据库。随后,该过程会产生高昂的计算成本,对于实时应用来说并不可行。本文有效地解决了给定目的下阈值动态更新的问题。本文提出了一种新颖的ARM方法,该方法创建中间项集并应用阈值来提取具有不同阈值的分类频繁项集,从而做出了贡献。这样一来,由于不再需要扫描整个数据库,整体效率得到了提高。在构建完整个项集后,我们无需重建项集就能获得实际支持度(例如,通过交集项集列表来获得实际支持度)。此外,该算法支持根据预先确定的最小支持度独立地提取多个频繁项集。此外,我们提出的方法的实验结果表明它能够以完全并行模式部署在任何挖掘系统中;因此,提高了实时关联规则发现过程的效率。所提出的方法优于现有的最先进方法,并显示出降低计算成本、提高准确性和生成所有可能项集的良好结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/bb21dbae5670/pone.0179703.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/5757dd6ef58d/pone.0179703.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/d195bd28792c/pone.0179703.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/c9b462bf1849/pone.0179703.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/f44a46d3ee04/pone.0179703.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/143948a9231b/pone.0179703.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/0f99f97c94a3/pone.0179703.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/e910580d1aad/pone.0179703.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/7565786552b7/pone.0179703.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/3de90c2cdf24/pone.0179703.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/1caee74390b7/pone.0179703.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/66dc67d1a751/pone.0179703.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/6204bff10bae/pone.0179703.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/195ebbd341bc/pone.0179703.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/e5c7f8c8967d/pone.0179703.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/bb21dbae5670/pone.0179703.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/5757dd6ef58d/pone.0179703.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/d195bd28792c/pone.0179703.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/c9b462bf1849/pone.0179703.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/f44a46d3ee04/pone.0179703.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/143948a9231b/pone.0179703.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/0f99f97c94a3/pone.0179703.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/e910580d1aad/pone.0179703.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/7565786552b7/pone.0179703.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/3de90c2cdf24/pone.0179703.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/1caee74390b7/pone.0179703.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/66dc67d1a751/pone.0179703.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/6204bff10bae/pone.0179703.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/195ebbd341bc/pone.0179703.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/e5c7f8c8967d/pone.0179703.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896f/5774682/bb21dbae5670/pone.0179703.g015.jpg

相似文献

1
A novel association rule mining approach using TID intermediate itemset.一种使用事务标识(TID)中间项集的新型关联规则挖掘方法。
PLoS One. 2018 Jan 19;13(1):e0179703. doi: 10.1371/journal.pone.0179703. eCollection 2018.
2
The Mining Algorithm of Maximum Frequent Itemsets Based on Frequent Pattern Tree.基于频繁模式树的最大频繁项集挖掘算法。
Comput Intell Neurosci. 2022 May 18;2022:7022168. doi: 10.1155/2022/7022168. eCollection 2022.
3
Marginal frequent itemset mining for fault prevention of railway overhead contact system.用于铁路架空接触网系统故障预防的边际频繁项集挖掘
ISA Trans. 2022 Jul;126:276-287. doi: 10.1016/j.isatra.2021.07.018. Epub 2021 Jul 13.
4
An efficient pattern growth approach for mining fault tolerant frequent itemsets.一种用于挖掘容错频繁项集的高效模式增长方法。
Expert Syst Appl. 2020 Apr 1;143:113046. doi: 10.1016/j.eswa.2019.113046. Epub 2019 Oct 21.
5
Quantifying the informativeness for biomedical literature summarization: An itemset mining method.量化生物医学文献摘要的信息量:一种基于项集挖掘的方法。
Comput Methods Programs Biomed. 2017 Jul;146:77-89. doi: 10.1016/j.cmpb.2017.05.011. Epub 2017 May 27.
6
Efficient Top-K Identical Frequent Itemsets Mining without Support Threshold Parameter from Transactional Datasets Produced by IoT-Based Smart Shopping Carts.从基于物联网的智能购物车生成的事务性数据集高效挖掘无支持阈值参数的 Top-K 相同频繁项集。
Sensors (Basel). 2022 Oct 21;22(20):8063. doi: 10.3390/s22208063.
7
Diagnosis of coronary artery disease using an efficient hash table based closed frequent itemsets mining.使用基于高效哈希表的封闭频繁项集挖掘技术诊断冠状动脉疾病。
Med Biol Eng Comput. 2018 May;56(5):749-759. doi: 10.1007/s11517-017-1719-6. Epub 2017 Sep 14.
8
Negative and positive association rules mining from text using frequent and infrequent itemsets.基于频繁和非频繁项集从文本中挖掘正负关联规则。
ScientificWorldJournal. 2014;2014:973750. doi: 10.1155/2014/973750. Epub 2014 May 18.
9
Personalized privacy-preserving frequent itemset mining using randomized response.使用随机应答的个性化隐私保护频繁项集挖掘
ScientificWorldJournal. 2014;2014:686151. doi: 10.1155/2014/686151. Epub 2014 Mar 30.
10
Parallel and distributed methods for incremental frequent itemset mining.用于增量频繁项集挖掘的并行和分布式方法。
IEEE Trans Syst Man Cybern B Cybern. 2004 Dec;34(6):2439-50. doi: 10.1109/tsmcb.2004.836887.

引用本文的文献

1
The influence of machine learning-based knowledge management model on enterprise organizational capability innovation and industrial development.基于机器学习的知识管理模型对企业组织能力创新和产业发展的影响。
PLoS One. 2020 Dec 1;15(12):e0242253. doi: 10.1371/journal.pone.0242253. eCollection 2020.
2
A Theoretical Approach for Correlating Proteins to Malignant Diseases.一种将蛋白质与恶性疾病相关联的理论方法。
Front Mol Biosci. 2020 Oct 22;7:582593. doi: 10.3389/fmolb.2020.582593. eCollection 2020.
3
Correction: A novel association rule mining approach using TID intermediate itemset.

本文引用的文献

1
Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism.具有多个最小支持度的关联规则挖掘:一种新的挖掘算法和支持度调整机制。
Decis Support Syst. 2006 Oct;42(1):1-24. doi: 10.1016/j.dss.2004.09.007. Epub 2004 Nov 30.
更正:一种使用事务标识符(TID)中间项集的新型关联规则挖掘方法。
PLoS One. 2018 May 1;13(5):e0196957. doi: 10.1371/journal.pone.0196957. eCollection 2018.