• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于序列模式挖掘的节点链接方法。

A node linkage approach for sequential pattern mining.

作者信息

Navarro Osvaldo, Cumplido René, Villaseñor-Pineda Luis, Feregrino-Uribe Claudia, Carrasco-Ochoa Jesús Ariel

机构信息

Departamento de Ciencias Computacionales, Instituto Nacional de Astrofísica, Óptica y Electrónica, Sta. Ma. Tonantzintla, Puebla, México.

出版信息

PLoS One. 2014 Jun 16;9(6):e95418. doi: 10.1371/journal.pone.0095418. eCollection 2014.

DOI:10.1371/journal.pone.0095418
PMID:24933123
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4059635/
Abstract

Sequential Pattern Mining is a widely addressed problem in data mining, with applications such as analyzing Web usage, examining purchase behavior, and text mining, among others. Nevertheless, with the dramatic increase in data volume, the current approaches prove inefficient when dealing with large input datasets, a large number of different symbols and low minimum supports. In this paper, we propose a new sequential pattern mining algorithm, which follows a pattern-growth scheme to discover sequential patterns. Unlike most pattern growth algorithms, our approach does not build a data structure to represent the input dataset, but instead accesses the required sequences through pseudo-projection databases, achieving better runtime and reducing memory requirements. Our algorithm traverses the search space in a depth-first fashion and only preserves in memory a pattern node linkage and the pseudo-projections required for the branch being explored at the time. Experimental results show that our new approach, the Node Linkage Depth-First Traversal algorithm (NLDFT), has better performance and scalability in comparison with state of the art algorithms.

摘要

序列模式挖掘是数据挖掘中一个被广泛研究的问题,其应用包括分析网页使用情况、研究购买行为以及文本挖掘等。然而,随着数据量的急剧增加,当前的方法在处理大型输入数据集、大量不同符号和低最小支持度时效率低下。在本文中,我们提出了一种新的序列模式挖掘算法,该算法遵循模式增长方案来发现序列模式。与大多数模式增长算法不同,我们的方法不构建数据结构来表示输入数据集,而是通过伪投影数据库访问所需序列,从而实现更好的运行时性能并降低内存需求。我们的算法以深度优先的方式遍历搜索空间,并且仅在内存中保留模式节点链接和当时正在探索的分支所需的伪投影。实验结果表明,我们的新方法,即节点链接深度优先遍历算法(NLDFT),与现有算法相比具有更好的性能和可扩展性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/8fcee0477b4c/pone.0095418.g020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/0cb203df160a/pone.0095418.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/dd7a25f06a9a/pone.0095418.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/3d8d9bc89e08/pone.0095418.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/46834007d453/pone.0095418.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/ef5737cd04e4/pone.0095418.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/5051ef5a7d9f/pone.0095418.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/308c0c5ff479/pone.0095418.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/b87c2607843b/pone.0095418.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/ebf4218193fd/pone.0095418.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/332dc048f3ab/pone.0095418.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/fc3915046d1f/pone.0095418.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/518286a224a0/pone.0095418.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/f17bb6ae7552/pone.0095418.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/3afbea96b091/pone.0095418.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/1803035e5ba3/pone.0095418.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/22a26ec4c23d/pone.0095418.g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/cdf985c1bb48/pone.0095418.g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/336848382b7f/pone.0095418.g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/1fb9fcb91fdb/pone.0095418.g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/8fcee0477b4c/pone.0095418.g020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/0cb203df160a/pone.0095418.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/dd7a25f06a9a/pone.0095418.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/3d8d9bc89e08/pone.0095418.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/46834007d453/pone.0095418.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/ef5737cd04e4/pone.0095418.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/5051ef5a7d9f/pone.0095418.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/308c0c5ff479/pone.0095418.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/b87c2607843b/pone.0095418.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/ebf4218193fd/pone.0095418.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/332dc048f3ab/pone.0095418.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/fc3915046d1f/pone.0095418.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/518286a224a0/pone.0095418.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/f17bb6ae7552/pone.0095418.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/3afbea96b091/pone.0095418.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/1803035e5ba3/pone.0095418.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/22a26ec4c23d/pone.0095418.g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/cdf985c1bb48/pone.0095418.g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/336848382b7f/pone.0095418.g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/1fb9fcb91fdb/pone.0095418.g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a77/4059635/8fcee0477b4c/pone.0095418.g020.jpg

相似文献

1
A node linkage approach for sequential pattern mining.一种用于序列模式挖掘的节点链接方法。
PLoS One. 2014 Jun 16;9(6):e95418. doi: 10.1371/journal.pone.0095418. eCollection 2014.
2
Mining of high utility-probability sequential patterns from uncertain databases.从不确定数据库中挖掘高效用概率序列模式。
PLoS One. 2017 Jul 25;12(7):e0180931. doi: 10.1371/journal.pone.0180931. eCollection 2017.
3
Mining Contiguous Sequential Generators in Biological Sequences.挖掘生物序列中的连续序列生成器
IEEE/ACM Trans Comput Biol Bioinform. 2016 Sep-Oct;13(5):855-867. doi: 10.1109/TCBB.2015.2495132. Epub 2015 Oct 26.
4
Efficient mining gapped sequential patterns for motifs in biological sequences.用于挖掘生物序列中基序的带间隙序列模式的高效挖掘方法。
BMC Syst Biol. 2013;7 Suppl 4(Suppl 4):S7. doi: 10.1186/1752-0509-7-S4-S7. Epub 2013 Oct 23.
5
Mining high occupancy patterns to analyze incremental data in intelligent systems.
ISA Trans. 2022 Dec;131:460-475. doi: 10.1016/j.isatra.2022.05.003. Epub 2022 May 10.
6
Discovering metric temporal constraint networks on temporal databases.发现时态数据库上的度量时态约束网络。
Artif Intell Med. 2013 Jul;58(3):139-54. doi: 10.1016/j.artmed.2013.03.006. Epub 2013 May 6.
7
Mining actionable combined high utility incremental and associated sequential patterns.挖掘可操作的组合高效用增量和关联序列模式。
PLoS One. 2023 Mar 29;18(3):e0283365. doi: 10.1371/journal.pone.0283365. eCollection 2023.
8
Frequent patterns mining in multiple biological sequences.多生物序列中的频繁模式挖掘。
Comput Biol Med. 2013 Oct;43(10):1444-52. doi: 10.1016/j.compbiomed.2013.07.009. Epub 2013 Jul 27.
9
MpBsmi: A new algorithm for the recognition of continuous biological sequence pattern based on index structure.MpBsmi:一种基于索引结构的连续生物序列模式识别新算法。
PLoS One. 2018 Apr 23;13(4):e0195601. doi: 10.1371/journal.pone.0195601. eCollection 2018.
10
Analysing large biological data sets with an improved algorithm for MIC.使用一种改进的最小抑菌浓度(MIC)算法分析大型生物数据集。
Int J Data Min Bioinform. 2015;13(2):158-70. doi: 10.1504/ijdmb.2015.071548.