• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

HUOPM:高效用占用模式挖掘。

HUOPM: High-Utility Occupancy Pattern Mining.

出版信息

IEEE Trans Cybern. 2020 Mar;50(3):1195-1208. doi: 10.1109/TCYB.2019.2896267. Epub 2019 Feb 20.

DOI:10.1109/TCYB.2019.2896267
PMID:30794524
Abstract

Mining useful patterns from varied types of databases is an important research topic, which has many real-life applications. Most studies have considered the frequency as sole interestingness measure to identify high-quality patterns. However, each object is different in nature. The relative importance of objects is not equal, in terms of criteria, such as the utility, risk, or interest. Besides, another limitation of frequent patterns is that they generally have a low occupancy, that is, they often represent small sets of items in transactions containing many items and, thus, may not be truly representative of these transactions. To extract high-quality patterns in real-life applications, this paper extends the occupancy measure to also assess the utility of patterns in transaction databases. We propose an efficient algorithm named high-utility occupancy pattern mining (HUOPM). It considers user preferences in terms of frequency, utility, and occupancy. A novel frequency-utility tree and two compact data structures, called the utility-occupancy list and frequency-utility table, are designed to provide global and partial downward closure properties for pruning the search space. The proposed method can efficiently discover the complete set of high-quality patterns without candidate generation. Extensive experiments have been conducted on several datasets to evaluate the effectiveness and efficiency of the proposed algorithm. Results show that the derived patterns are intelligible, reasonable, and acceptable, and that HUOPM with its pruning strategies outperforms the state-of-the-art algorithm, in terms of runtime and search space, respectively.

摘要

从各种类型的数据库中挖掘有用的模式是一个重要的研究课题,它有许多实际应用。大多数研究都考虑将频率作为唯一的有趣性度量来识别高质量的模式。然而,每个对象的性质都不同。在某些标准(如效用、风险或兴趣)方面,对象的相对重要性是不平等的。此外,频繁模式的另一个局限性是它们通常具有低占用率,也就是说,它们通常代表交易中许多项目中的一小部分项目,因此可能无法真正代表这些交易。为了在实际应用中提取高质量的模式,本文将占用率度量扩展到交易数据库中模式的效用评估。我们提出了一种名为高效用占用模式挖掘(HUOPM)的高效算法。它考虑了用户在频率、效用和占用率方面的偏好。设计了一种新颖的频率-效用树和两个紧凑的数据结构,称为效用-占用列表和频率-效用表,以提供全局和部分向下闭包属性,以修剪搜索空间。该方法可以在不生成候选的情况下有效地发现完整的高质量模式集。在几个数据集上进行了广泛的实验,以评估所提出算法的有效性和效率。结果表明,所得到的模式是可理解的、合理的和可接受的,并且所提出的算法及其修剪策略在运行时和搜索空间方面均优于最先进的算法。

相似文献

1
HUOPM: High-Utility Occupancy Pattern Mining.HUOPM:高效用占用模式挖掘。
IEEE Trans Cybern. 2020 Mar;50(3):1195-1208. doi: 10.1109/TCYB.2019.2896267. Epub 2019 Feb 20.
2
Fast Utility Mining on Sequence Data.快速序列数据实用挖掘。
IEEE Trans Cybern. 2021 Feb;51(2):487-500. doi: 10.1109/TCYB.2020.2970176. Epub 2021 Jan 15.
3
An incremental high-utility mining algorithm with transaction insertion.一种带有事务插入的增量式高实用性挖掘算法。
ScientificWorldJournal. 2015;2015:161564. doi: 10.1155/2015/161564. Epub 2015 Feb 25.
4
Mining high occupancy patterns to analyze incremental data in intelligent systems.
ISA Trans. 2022 Dec;131:460-475. doi: 10.1016/j.isatra.2022.05.003. Epub 2022 May 10.
5
Mining actionable combined high utility incremental and associated sequential patterns.挖掘可操作的组合高效用增量和关联序列模式。
PLoS One. 2023 Mar 29;18(3):e0283365. doi: 10.1371/journal.pone.0283365. eCollection 2023.
6
Mining of high utility-probability sequential patterns from uncertain databases.从不确定数据库中挖掘高效用概率序列模式。
PLoS One. 2017 Jul 25;12(7):e0180931. doi: 10.1371/journal.pone.0180931. eCollection 2017.
7
Apriori Versions Based on MapReduce for Mining Frequent Patterns on Big Data.基于 MapReduce 的频繁模式挖掘的先验版本。
IEEE Trans Cybern. 2018 Oct;48(10):2851-2865. doi: 10.1109/TCYB.2017.2751081. Epub 2017 Sep 27.
8
Scalable and Efficient Approach for High Temporal Fuzzy Utility Pattern Mining.用于高时间模糊效用模式挖掘的可扩展高效方法
IEEE Trans Cybern. 2023 Dec;53(12):7672-7685. doi: 10.1109/TCYB.2022.3198661. Epub 2023 Nov 29.
9
Efficient discovery of risk patterns in medical data.医学数据中风险模式的高效发现。
Artif Intell Med. 2009 Jan;45(1):77-89. doi: 10.1016/j.artmed.2008.07.008. Epub 2008 Sep 9.
10
Differentially Private Frequent Sequence Mining via Sampling-based Candidate Pruning.基于采样候选剪枝的差分隐私频繁序列挖掘
Proc Int Conf Data Eng. 2015 Apr;2015:1035-1046. doi: 10.1109/ICDE.2015.7113354.

引用本文的文献

1
NetNMSP: Nonoverlapping maximal sequential pattern mining.NetNMSP:非重叠最大顺序模式挖掘。
Appl Intell (Dordr). 2022;52(9):9861-9884. doi: 10.1007/s10489-021-02912-3. Epub 2022 Jan 10.
2
Efficient Algorithm for Mining Non-Redundant High-Utility Association Rules.高效挖掘非冗余高效用关联规则的算法。
Sensors (Basel). 2020 Feb 17;20(4):1078. doi: 10.3390/s20041078.