• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CGBayesNets:混合离散和连续数据条件高斯贝叶斯网络学习与推理。

CGBayesNets: conditional Gaussian Bayesian network learning and inference with mixed discrete and continuous data.

机构信息

Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, Massachusetts, United States of America; Harvard Medical School, Boston, Massachusetts, United States of America.

Harvard Medical School, Boston, Massachusetts, United States of America; Children's Hospital Informatics Program, Children's Hospital Boston, Boston, Massachusetts, United States of America.

出版信息

PLoS Comput Biol. 2014 Jun 12;10(6):e1003676. doi: 10.1371/journal.pcbi.1003676. eCollection 2014 Jun.

DOI:10.1371/journal.pcbi.1003676
PMID:24922310
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4055564/
Abstract

Bayesian Networks (BN) have been a popular predictive modeling formalism in bioinformatics, but their application in modern genomics has been slowed by an inability to cleanly handle domains with mixed discrete and continuous variables. Existing free BN software packages either discretize continuous variables, which can lead to information loss, or do not include inference routines, which makes prediction with the BN impossible. We present CGBayesNets, a BN package focused around prediction of a clinical phenotype from mixed discrete and continuous variables, which fills these gaps. CGBayesNets implements Bayesian likelihood and inference algorithms for the conditional Gaussian Bayesian network (CGBNs) formalism, one appropriate for predicting an outcome of interest from, e.g., multimodal genomic data. We provide four different network learning algorithms, each making a different tradeoff between computational cost and network likelihood. CGBayesNets provides a full suite of functions for model exploration and verification, including cross validation, bootstrapping, and AUC manipulation. We highlight several results obtained previously with CGBayesNets, including predictive models of wood properties from tree genomics, leukemia subtype classification from mixed genomic data, and robust prediction of intensive care unit mortality outcomes from metabolomic profiles. We also provide detailed example analysis on public metabolomic and gene expression datasets. CGBayesNets is implemented in MATLAB and available as MATLAB source code, under an Open Source license and anonymous download at http://www.cgbayesnets.com.

摘要

贝叶斯网络(BN)一直是生物信息学中一种流行的预测建模形式,但由于无法干净地处理混合离散和连续变量的领域,其在现代基因组学中的应用受到了阻碍。现有的免费 BN 软件包要么对连续变量进行离散化,这可能导致信息丢失,要么不包括推理例程,这使得使用 BN 进行预测成为不可能。我们提出了 CGBayesNets,这是一个专注于从混合离散和连续变量预测临床表型的 BN 包,填补了这些空白。CGBayesNets 实现了条件高斯贝叶斯网络(CGBN)形式的贝叶斯似然和推理算法,非常适合从多模态基因组数据等预测感兴趣的结果。我们提供了四种不同的网络学习算法,每种算法在计算成本和网络似然之间都有不同的权衡。CGBayesNets 提供了一套完整的模型探索和验证功能,包括交叉验证、引导和 AUC 操作。我们强调了之前使用 CGBayesNets 获得的几个结果,包括从树木基因组预测木材特性的预测模型、从混合基因组数据分类白血病亚型、以及从代谢组学谱稳健预测重症监护病房死亡率的结果。我们还提供了公共代谢组学和基因表达数据集的详细示例分析。CGBayesNets 是用 MATLAB 编写的,并以 MATLAB 源代码的形式提供,根据开源许可证和匿名下载在 http://www.cgbayesnets.com 上提供。

相似文献

1
CGBayesNets: conditional Gaussian Bayesian network learning and inference with mixed discrete and continuous data.CGBayesNets:混合离散和连续数据条件高斯贝叶斯网络学习与推理。
PLoS Comput Biol. 2014 Jun 12;10(6):e1003676. doi: 10.1371/journal.pcbi.1003676. eCollection 2014 Jun.
2
: A Novel Bayesian Network Structural Learning Algorithm and Its Comprehensive Performance Evaluation Against Open-Source Software.一种新的贝叶斯网络结构学习算法及其与开源软件的综合性能评估
J Comput Biol. 2020 May;27(5):698-708. doi: 10.1089/cmb.2019.0210. Epub 2019 Sep 5.
3
Dynamic interaction network inference from longitudinal microbiome data.从纵向微生物组数据推断动态相互作用网络。
Microbiome. 2019 Apr 2;7(1):54. doi: 10.1186/s40168-019-0660-3.
4
Bayesian network-response regression.贝叶斯网络-响应回归。
Bioinformatics. 2017 Jun 15;33(12):1859-1866. doi: 10.1093/bioinformatics/btx050.
5
New Algorithm and Software (BNOmics) for Inferring and Visualizing Bayesian Networks from Heterogeneous Big Biological and Genetic Data.用于从异构生物大数据和遗传数据推断和可视化贝叶斯网络的新算法与软件(BNOmics)
J Comput Biol. 2017 Apr;24(4):340-356. doi: 10.1089/cmb.2016.0100. Epub 2016 Sep 28.
6
Metabolomic derangements are associated with mortality in critically ill adult patients.代谢紊乱与危重症成年患者的死亡率相关。
PLoS One. 2014 Jan 30;9(1):e87538. doi: 10.1371/journal.pone.0087538. eCollection 2014.
7
Bayesian Network Webserver: a comprehensive tool for biological network modeling.贝叶斯网络网络服务器:生物网络建模的综合工具。
Bioinformatics. 2013 Nov 1;29(21):2801-3. doi: 10.1093/bioinformatics/btt472. Epub 2013 Aug 21.
8
Comparative evaluation of reverse engineering gene regulatory networks with relevance networks, graphical gaussian models and bayesian networks.利用相关网络、图形高斯模型和贝叶斯网络对基因调控网络进行逆向工程的比较评估。
Bioinformatics. 2006 Oct 15;22(20):2523-31. doi: 10.1093/bioinformatics/btl391. Epub 2006 Jul 14.
9
A sparse structure learning algorithm for Gaussian Bayesian Network identification from high-dimensional data.一种从高维数据中识别高斯贝叶斯网络的稀疏结构学习算法。
IEEE Trans Pattern Anal Mach Intell. 2013 Jun;35(6):1328-42. doi: 10.1109/TPAMI.2012.129.
10
Joint network and node selection for pathway-based genomic data analysis.基于通路的基因组数据分析的联合网络和节点选择。
Bioinformatics. 2013 Aug 15;29(16):1987-96. doi: 10.1093/bioinformatics/btt335. Epub 2013 Jun 8.

引用本文的文献

1
Enhancing stroke-associated pneumonia prediction in ischemic stroke: An interpretable Bayesian network approach.提高缺血性卒中相关性肺炎的预测:一种可解释的贝叶斯网络方法。
Digit Health. 2025 Apr 15;11:20552076251333568. doi: 10.1177/20552076251333568. eCollection 2025 Jan-Dec.
2
Interpolation of microbiome composition in longitudinal data sets.纵向数据集的微生物组组成内插。
mBio. 2024 Sep 11;15(9):e0115024. doi: 10.1128/mbio.01150-24. Epub 2024 Aug 20.
3
Current Trends and Challenges of Microbiome Research in Prostate Cancer.当前前列腺癌微生物组研究的趋势和挑战。
Curr Oncol Rep. 2024 May;26(5):477-487. doi: 10.1007/s11912-024-01520-x. Epub 2024 Apr 4.
4
Methodological Considerations in Longitudinal Analyses of Microbiome Data: A Comprehensive Review.纵向分析微生物组数据的方法学考虑:全面综述。
Genes (Basel). 2023 Dec 28;15(1):0. doi: 10.3390/genes15010051.
5
Prognostic models for breast cancer: based on logistics regression and Hybrid Bayesian Network.乳腺癌预后模型:基于逻辑回归和混合贝叶斯网络。
BMC Med Inform Decis Mak. 2023 Jul 13;23(1):120. doi: 10.1186/s12911-023-02224-1.
6
Statistical challenges in longitudinal microbiome data analysis.纵向微生物组数据分析中的统计挑战。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac273.
7
Plant Genotype to Phenotype Prediction Using Machine Learning.利用机器学习进行植物基因型到表型的预测
Front Genet. 2022 May 18;13:822173. doi: 10.3389/fgene.2022.822173. eCollection 2022.
8
Metabolomics-Guided Elucidation of Plant Abiotic Stress Responses in the 4IR Era: An Overview.代谢组学引导的4IR时代植物非生物胁迫响应解析:综述
Metabolites. 2021 Jul 8;11(7):445. doi: 10.3390/metabo11070445.
9
The sputum transcriptome better predicts COPD exacerbations after the withdrawal of inhaled corticosteroids than sputum eosinophils.与痰液嗜酸性粒细胞相比,痰液转录组在吸入性糖皮质激素撤药后能更好地预测慢性阻塞性肺疾病(COPD)急性加重。
ERJ Open Res. 2021 Jul 5;7(3). doi: 10.1183/23120541.00097-2021. eCollection 2021 Jul.
10
Dynamic Bayesian Networks for Integrating Multi-omics Time Series Microbiome Data.用于整合多组学时间序列微生物组数据的动态贝叶斯网络
mSystems. 2021 Mar 30;6(2):e01105-20. doi: 10.1128/mSystems.01105-20.

本文引用的文献

1
Metabolomic derangements are associated with mortality in critically ill adult patients.代谢紊乱与危重症成年患者的死亡率相关。
PLoS One. 2014 Jan 30;9(1):e87538. doi: 10.1371/journal.pone.0087538. eCollection 2014.
2
Network analysis reveals the relationship among wood properties, gene expression levels and genotypes of natural Populus trichocarpa accessions.网络分析揭示了天然胡杨无性系木材性质、基因表达水平和基因型之间的关系。
New Phytol. 2013 Nov;200(3):727-742. doi: 10.1111/nph.12419. Epub 2013 Jul 29.
3
Predicting inhaled corticosteroid response in asthma with two associated SNPs.用两个相关的 SNP 预测哮喘患者吸入皮质激素的反应。
Pharmacogenomics J. 2013 Aug;13(4):306-11. doi: 10.1038/tpj.2012.15. Epub 2012 May 29.
4
MetaboAnalyst 2.0--a comprehensive server for metabolomic data analysis.MetaboAnalyst 2.0--一个全面的代谢组学数据分析服务器。
Nucleic Acids Res. 2012 Jul;40(Web Server issue):W127-33. doi: 10.1093/nar/gks374. Epub 2012 May 2.
5
Phenotype prediction by integrative network analysis of SNP and gene expression microarrays.通过单核苷酸多态性(SNP)和基因表达微阵列的整合网络分析进行表型预测。
Annu Int Conf IEEE Eng Med Biol Soc. 2011;2011:6849-52. doi: 10.1109/IEMBS.2011.6091689.
6
GlobalMIT: learning globally optimal dynamic bayesian network with the mutual information test criterion.GlobalMIT:使用互信息测试准则学习全局最优动态贝叶斯网络。
Bioinformatics. 2011 Oct 1;27(19):2765-6. doi: 10.1093/bioinformatics/btr457. Epub 2011 Aug 3.
7
Pathways activated during human asthma exacerbation as revealed by gene expression patterns in blood.人哮喘加重时血液中基因表达模式揭示的通路。
PLoS One. 2011;6(7):e21902. doi: 10.1371/journal.pone.0021902. Epub 2011 Jul 14.
8
Mapping transcription mechanisms from multimodal genomic data.从多模态基因组数据中绘制转录机制。
BMC Bioinformatics. 2010 Oct 28;11 Suppl 9(Suppl 9):S2. doi: 10.1186/1471-2105-11-S9-S2.
9
Integrative predictive model of coronary artery calcification in atherosclerosis.动脉粥样硬化症冠状动脉钙化的综合预测模型。
Circulation. 2009 Dec 15;120(24):2448-54. doi: 10.1161/CIRCULATIONAHA.109.865501.
10
A testable prognostic model of nicotine dependence.尼古丁依赖的可测试预后模型。
J Neurogenet. 2009;23(3):283-92. doi: 10.1080/01677060802572911. Epub 2009 Jan 31.