• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于贝叶斯网络数据密集型学习的并行增量方法。

A Parallel and Incremental Approach for Data-Intensive Learning of Bayesian Networks.

出版信息

IEEE Trans Cybern. 2015 Dec;45(12):2890-904. doi: 10.1109/TCYB.2015.2388791. Epub 2015 Jan 22.

DOI:10.1109/TCYB.2015.2388791
PMID:25622335
Abstract

Bayesian network (BN) has been adopted as the underlying model for representing and inferring uncertain knowledge. As the basis of realistic applications centered on probabilistic inferences, learning a BN from data is a critical subject of machine learning, artificial intelligence, and big data paradigms. Currently, it is necessary to extend the classical methods for learning BNs with respect to data-intensive computing or in cloud environments. In this paper, we propose a parallel and incremental approach for data-intensive learning of BNs from massive, distributed, and dynamically changing data by extending the classical scoring and search algorithm and using MapReduce. First, we adopt the minimum description length as the scoring metric and give the two-pass MapReduce-based algorithms for computing the required marginal probabilities and scoring the candidate graphical model from sample data. Then, we give the corresponding strategy for extending the classical hill-climbing algorithm to obtain the optimal structure, as well as that for storing a BN by <key, value> pairs. Further, in view of the dynamic characteristics of the changing data, we give the concept of influence degree to measure the coincidence of the current BN with new data, and then propose the corresponding two-pass MapReduce-based algorithms for BNs incremental learning. Experimental results show the efficiency, scalability, and effectiveness of our methods.

摘要

贝叶斯网络(BN)已被用作表示和推断不确定知识的基础模型。作为以概率推理为中心的实际应用的基础,从数据中学习 BN 是机器学习、人工智能和大数据范例的关键课题。目前,有必要扩展用于数据密集型计算或云环境中的经典 BN 学习方法。在本文中,我们通过扩展经典的评分和搜索算法并使用 MapReduce 来提出一种用于从大规模、分布式和动态变化的数据中进行数据密集型 BN 学习的并行和增量方法。首先,我们采用最小描述长度作为评分指标,并给出基于两阶段 MapReduce 的算法,用于从样本数据计算所需的边缘概率和对候选图形模型进行评分。然后,我们给出了将经典爬山算法扩展以获得最优结构的相应策略,以及通过<键,值>对存储 BN 的策略。此外,针对数据变化的动态特性,我们给出了影响度的概念来衡量当前 BN 与新数据的一致性,然后提出了相应的基于两阶段 MapReduce 的 BN 增量学习算法。实验结果表明了我们方法的效率、可扩展性和有效性。

相似文献

1
A Parallel and Incremental Approach for Data-Intensive Learning of Bayesian Networks.一种用于贝叶斯网络数据密集型学习的并行增量方法。
IEEE Trans Cybern. 2015 Dec;45(12):2890-904. doi: 10.1109/TCYB.2015.2388791. Epub 2015 Jan 22.
2
A novel algorithm for scalable and accurate Bayesian network learning.一种用于可扩展且准确的贝叶斯网络学习的新算法。
Stud Health Technol Inform. 2004;107(Pt 1):711-5.
3
MapReduce Based Parallel Neural Networks in Enabling Large Scale Machine Learning.基于MapReduce的并行神经网络助力大规模机器学习。
Comput Intell Neurosci. 2015;2015:297672. doi: 10.1155/2015/297672. Epub 2015 Nov 22.
4
A hybrid Bayesian network learning method for constructing gene networks.一种用于构建基因网络的混合贝叶斯网络学习方法。
Comput Biol Chem. 2007 Oct;31(5-6):361-72. doi: 10.1016/j.compbiolchem.2007.08.005. Epub 2007 Aug 19.
5
Incorporating expert knowledge when learning Bayesian network structure: a medical case study.在学习贝叶斯网络结构时纳入专家知识:一个医学案例研究。
Artif Intell Med. 2011 Nov;53(3):181-204. doi: 10.1016/j.artmed.2011.08.004. Epub 2011 Sep 29.
6
Developing Bayesian networks from a dependency-layered ontology: A proof-of-concept in radiation oncology.从依赖分层本体中开发贝叶斯网络:放射肿瘤学中的概念验证。
Med Phys. 2017 Aug;44(8):4350-4359. doi: 10.1002/mp.12340. Epub 2017 Jun 30.
7
A Novel BN Learning Algorithm Based on Block Learning Strategy.基于分块学习策略的新型贝叶斯网络学习算法。
Sensors (Basel). 2020 Nov 7;20(21):6357. doi: 10.3390/s20216357.
8
Mutual information preconditioning improves structure learning of Bayesian networks from medical databases.互信息预处理可改善从医学数据库中学习贝叶斯网络的结构。
IEEE Trans Inf Technol Biomed. 2009 Nov;13(6):984-9. doi: 10.1109/TITB.2009.2026273. Epub 2009 Jul 28.
9
Biomedical knowledge discovery with topological constraints modeling in Bayesian networks: a preliminary report.贝叶斯网络中基于拓扑约束建模的生物医学知识发现:初步报告
Stud Health Technol Inform. 2007;129(Pt 1):560-5.
10
Growing Bayesian network models of gene networks from seed genes.从种子基因构建基因网络的贝叶斯网络增长模型。
Bioinformatics. 2005 Sep 1;21 Suppl 2:ii224-9. doi: 10.1093/bioinformatics/bti1137.

引用本文的文献

1
Machine Learning in Unmanned Systems for Chemical Synthesis.无人系统中的机器学习在化学合成中的应用。
Molecules. 2023 Feb 27;28(5):2232. doi: 10.3390/molecules28052232.
2
Differential Diagnostic Reasoning Method for Benign Paroxysmal Positional Vertigo Based on Dynamic Uncertain Causality Graph.基于动态不确定因果图的良性阵发性位置性眩晕鉴别诊断推理方法。
Comput Math Methods Med. 2020 Jan 24;2020:1541989. doi: 10.1155/2020/1541989. eCollection 2020.
3
PEnBayes: A Multi-Layered Ensemble Approach for Learning Bayesian Network Structure from Big Data.
PenBayes:一种从大数据中学习贝叶斯网络结构的多层集成方法。
Sensors (Basel). 2019 Oct 11;19(20):4400. doi: 10.3390/s19204400.