• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

HUIL-TN 和 HUI-TN:基于模式增长的高实用项集挖掘。

HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth.

机构信息

College of Digital Technology and Engineering, Ningbo University of Finance and Economics, Ningbo, Zhejiang, China.

出版信息

PLoS One. 2021 Mar 12;16(3):e0248349. doi: 10.1371/journal.pone.0248349. eCollection 2021.

DOI:10.1371/journal.pone.0248349
PMID:33711048
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7954358/
Abstract

In recent years, high utility itemsets (HUIs) mining has been an active research topic in data mining. In this study, we propose two efficient pattern-growth based HUI mining algorithms, called High Utility Itemset based on Length and Tail-Node tree (HUIL-TN) and High Utility Itemset based on Tail-Node tree (HUI-TN). These two algorithms avoid the time-consuming candidate generation stage and the need of scanning the original dataset multiple times for exact utility values. A novel tree structure, named tail-node tree (TN-tree) is proposed as a key element of our algorithms to maintain complete utililty-information of existing itemsets of a dataset. The performance of HUIL-TN and HUI-TN was evaluated against state-of-the-art reference methods on various datasets. Experimental results showed that our algorithms exceed or close to the best performance on all datasets in terms of running time, while other algorithms can only excel in certain types of dataset. Scalability tests were also performed and our algorithms obtained the flattest curves among all competitors.

摘要

近年来,高实用项目集(HUI)挖掘已成为数据挖掘领域的一个活跃研究课题。在这项研究中,我们提出了两种基于模式增长的高效 HUI 挖掘算法,分别称为基于长度和尾节点树的高实用项目集(HUIL-TN)和基于尾节点树的高实用项目集(HUI-TN)。这两种算法避免了耗时的候选生成阶段,以及多次扫描原始数据集以获取精确效用值的需求。我们提出了一种新的树结构,称为尾节点树(TN-tree),作为我们算法的关键元素,用于维护数据集现有项目集的完整效用信息。HUIL-TN 和 HUI-TN 的性能在各种数据集上与最先进的参考方法进行了评估。实验结果表明,在运行时间方面,我们的算法在所有数据集上都超过或接近最佳性能,而其他算法只能在某些类型的数据集上表现出色。还进行了可扩展性测试,我们的算法在所有竞争对手中获得了最平坦的曲线。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/616a64440657/pone.0248349.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/87d7250e4cc2/pone.0248349.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/a699b8eed116/pone.0248349.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/eed41b3a7ce4/pone.0248349.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/36d325a8988c/pone.0248349.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/55a358cc5a7d/pone.0248349.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/2e59850fef06/pone.0248349.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/23b440127da7/pone.0248349.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/4c3c7905ec7b/pone.0248349.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/c2fad73709b1/pone.0248349.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/616a64440657/pone.0248349.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/87d7250e4cc2/pone.0248349.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/a699b8eed116/pone.0248349.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/eed41b3a7ce4/pone.0248349.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/36d325a8988c/pone.0248349.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/55a358cc5a7d/pone.0248349.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/2e59850fef06/pone.0248349.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/23b440127da7/pone.0248349.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/4c3c7905ec7b/pone.0248349.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/c2fad73709b1/pone.0248349.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7433/7954358/616a64440657/pone.0248349.g010.jpg

相似文献

1
HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth.HUIL-TN 和 HUI-TN:基于模式增长的高实用项集挖掘。
PLoS One. 2021 Mar 12;16(3):e0248349. doi: 10.1371/journal.pone.0248349. eCollection 2021.
2
The Mining Algorithm of Maximum Frequent Itemsets Based on Frequent Pattern Tree.基于频繁模式树的最大频繁项集挖掘算法。
Comput Intell Neurosci. 2022 May 18;2022:7022168. doi: 10.1155/2022/7022168. eCollection 2022.
3
An efficient pattern growth approach for mining fault tolerant frequent itemsets.一种用于挖掘容错频繁项集的高效模式增长方法。
Expert Syst Appl. 2020 Apr 1;143:113046. doi: 10.1016/j.eswa.2019.113046. Epub 2019 Oct 21.
4
Mining Association rules for Low-Frequency itemsets.挖掘低频项集的关联规则。
PLoS One. 2018 Jul 23;13(7):e0198066. doi: 10.1371/journal.pone.0198066. eCollection 2018.
5
A novel association rule mining approach using TID intermediate itemset.一种使用事务标识(TID)中间项集的新型关联规则挖掘方法。
PLoS One. 2018 Jan 19;13(1):e0179703. doi: 10.1371/journal.pone.0179703. eCollection 2018.
6
An efficient algorithm for mining closed itemsets.一种挖掘封闭项集的高效算法。
J Zhejiang Univ Sci. 2004 Jan;5(1):8-15. doi: 10.1007/BF02839306.
7
Incremental high average-utility itemset mining: survey and challenges.增量式高平均效用项集挖掘:综述与挑战
Sci Rep. 2024 Apr 30;14(1):9924. doi: 10.1038/s41598-024-60279-0.
8
Negative and positive association rules mining from text using frequent and infrequent itemsets.基于频繁和非频繁项集从文本中挖掘正负关联规则。
ScientificWorldJournal. 2014;2014:973750. doi: 10.1155/2014/973750. Epub 2014 May 18.
9
Parallel and distributed methods for incremental frequent itemset mining.用于增量频繁项集挖掘的并行和分布式方法。
IEEE Trans Syst Man Cybern B Cybern. 2004 Dec;34(6):2439-50. doi: 10.1109/tsmcb.2004.836887.
10
Diagnosis of coronary artery disease using an efficient hash table based closed frequent itemsets mining.使用基于高效哈希表的封闭频繁项集挖掘技术诊断冠状动脉疾病。
Med Biol Eng Comput. 2018 May;56(5):749-759. doi: 10.1007/s11517-017-1719-6. Epub 2017 Sep 14.

引用本文的文献

1
Multi-level high utility-itemset hiding.多级高实用项集隐藏
PLoS One. 2025 Feb 3;20(2):e0317427. doi: 10.1371/journal.pone.0317427. eCollection 2025.
2
Incremental high average-utility itemset mining: survey and challenges.增量式高平均效用项集挖掘:综述与挑战
Sci Rep. 2024 Apr 30;14(1):9924. doi: 10.1038/s41598-024-60279-0.

本文引用的文献

1
Minimum Free Energy Coding for DNA Storage.最小自由能编码用于 DNA 存储。
IEEE Trans Nanobioscience. 2021 Apr;20(2):212-222. doi: 10.1109/TNB.2021.3056351. Epub 2021 Mar 31.
2
Efficient Algorithm for Mining Non-Redundant High-Utility Association Rules.高效挖掘非冗余高效用关联规则的算法。
Sensors (Basel). 2020 Feb 17;20(4):1078. doi: 10.3390/s20041078.