用于从异构生物大数据和遗传数据推断和可视化贝叶斯网络的新算法与软件（BNOmics）

New Algorithm and Software (BNOmics) for Inferring and Visualizing Bayesian Networks from Heterogeneous Big Biological and Genetic Data.

作者信息

Gogoshin Grigoriy, Boerwinkle Eric, Rodin Andrei S

机构信息

1 Diabetes and Metabolism Research Institute , City of Hope, Duarte, California.

2 Human Genetics Center, School of Public Health, University of Texas Health Science Center , Houston, Texas.

出版信息

J Comput Biol. 2017 Apr;24(4):340-356. doi: 10.1089/cmb.2016.0100. Epub 2016 Sep 28.

DOI:10.1089/cmb.2016.0100

PMID:27681505

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5372779/

Abstract

Bayesian network (BN) reconstruction is a prototypical systems biology data analysis approach that has been successfully used to reverse engineer and model networks reflecting different layers of biological organization (ranging from genetic to epigenetic to cellular pathway to metabolomic). It is especially relevant in the context of modern (ongoing and prospective) studies that generate heterogeneous high-throughput omics datasets. However, there are both theoretical and practical obstacles to the seamless application of BN modeling to such big data, including computational inefficiency of optimal BN structure search algorithms, ambiguity in data discretization, mixing data types, imputation and validation, and, in general, limited scalability in both reconstruction and visualization of BNs. To overcome these and other obstacles, we present BNOmics, an improved algorithm and software toolkit for inferring and analyzing BNs from omics datasets. BNOmics aims at comprehensive systems biology-type data exploration, including both generating new biological hypothesis and testing and validating the existing ones. Novel aspects of the algorithm center around increasing scalability and applicability to varying data types (with different explicit and implicit distributional assumptions) within the same analysis framework. An output and visualization interface to widely available graph-rendering software is also included. Three diverse applications are detailed. BNOmics was originally developed in the context of genetic epidemiology data and is being continuously optimized to keep pace with the ever-increasing inflow of available large-scale omics datasets. As such, the software scalability and usability on the less than exotic computer hardware are a priority, as well as the applicability of the algorithm and software to the heterogeneous datasets containing many data types-single-nucleotide polymorphisms and other genetic/epigenetic/transcriptome variables, metabolite levels, epidemiological variables, endpoints, and phenotypes, etc.

摘要

贝叶斯网络（BN）重建是一种典型的系统生物学数据分析方法，已成功用于逆向工程和建模反映不同生物组织层次（从基因到表观遗传，再到细胞途径和代谢组学）的网络。在生成异质高通量组学数据集的现代（正在进行和未来的）研究背景下，它尤其相关。然而，将BN建模无缝应用于此类大数据存在理论和实际障碍，包括最优BN结构搜索算法的计算效率低下、数据离散化的模糊性、数据类型混合、插补和验证，以及总体而言，BN重建和可视化的可扩展性有限。为了克服这些和其他障碍，我们提出了BNOmics，这是一种用于从组学数据集中推断和分析BN的改进算法和软件工具包。BNOmics旨在进行全面的系统生物学类型的数据探索，包括生成新的生物学假设以及测试和验证现有假设。该算法的新颖之处在于在同一分析框架内提高可扩展性以及对不同数据类型（具有不同的显式和隐式分布假设）的适用性。还包括与广泛使用的图形渲染软件的输出和可视化接口。详细介绍了三个不同的应用。BNOmics最初是在遗传流行病学数据的背景下开发的，并不断进行优化以跟上可用大规模组学数据集不断增加的流入量。因此，该软件在普通计算机硬件上的可扩展性和可用性是优先考虑的，以及该算法和软件对包含多种数据类型（单核苷酸多态性和其他遗传/表观遗传/转录组变量、代谢物水平、流行病学变量、终点和表型等）的异质数据集的适用性。

相似文献

New Algorithm and Software (BNOmics) for Inferring and Visualizing Bayesian Networks from Heterogeneous Big Biological and Genetic Data.

J Comput Biol. 2017 Apr;24(4):340-356. doi: 10.1089/cmb.2016.0100. Epub 2016 Sep 28.

Inferring cellular regulatory networks with Bayesian model averaging for linear regression (BMALR).

Mol Biosyst. 2014 Aug;10(8):2023-30. doi: 10.1039/c4mb00053f.

A hybrid Bayesian network learning method for constructing gene networks.

Comput Biol Chem. 2007 Oct;31(5-6):361-72. doi: 10.1016/j.compbiolchem.2007.08.005. Epub 2007 Aug 19.

H-CORE: enabling genome-scale Bayesian analysis of biological systems without prior knowledge.

Biosystems. 2007 Jul-Aug;90(1):197-210. doi: 10.1016/j.biosystems.2006.08.004. Epub 2006 Aug 22.

Reconstructing transcriptional regulatory networks using three-way mutual information and Bayesian networks.

Methods Mol Biol. 2010;674:401-18. doi: 10.1007/978-1-60761-854-6_23.

SAGA: a hybrid search algorithm for Bayesian Network structure learning of transcriptional regulatory networks.

J Biomed Inform. 2015 Feb;53:27-35. doi: 10.1016/j.jbi.2014.08.010. Epub 2014 Aug 30.

EXPLORING THE REPRODUCIBILITY OF PROBABILISTIC CAUSAL MOLECULAR NETWORK MODELS.

Pac Symp Biocomput. 2017;22:120-131. doi: 10.1142/9789813207813_0013.

Resolving the structure of interactomes with hierarchical agglomerative clustering.

BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S44. doi: 10.1186/1471-2105-12-S1-S44.

VANTED: A Tool for Integrative Visualization and Analysis of -Omics Data.

Methods Mol Biol. 2018;1696:261-278. doi: 10.1007/978-1-4939-7411-5_18.

CMIP: a software package capable of reconstructing genome-wide regulatory networks using gene expression data.

BMC Bioinformatics. 2016 Dec 23;17(Suppl 17):535. doi: 10.1186/s12859-016-1324-y.

引用本文的文献

Minimum uncertainty as Bayesian network model selection principle.

BMC Bioinformatics. 2025 Apr 8;26(1):100. doi: 10.1186/s12859-025-06104-5.

Temporally Resolved and Interpretable Machine Learning Model of GPCR conformational transition.

bioRxiv. 2025 Mar 17:2025.03.17.643765. doi: 10.1101/2025.03.17.643765.

BaNDyT: Bayesian Network modeling of molecular Dynamics Trajectories.

bioRxiv. 2024 Nov 8:2024.11.06.622318. doi: 10.1101/2024.11.06.622318.

Changes in expression of VGF, SPECC1L, HLA-DRA and RANBP3L act with APOE E4 to alter risk for late onset Alzheimer's disease.

Sci Rep. 2024 Jun 28;14(1):14954. doi: 10.1038/s41598-024-65010-7.

Structure-independent machine-learning predictions of the CDK12 interactome.

Biophys J. 2024 Sep 3;123(17):2910-2920. doi: 10.1016/j.bpj.2024.05.017. Epub 2024 May 18.

Regulation of Enhancers by SUMOylation Through TFAP2C Binding and Recruitment of HDAC Complex to the Chromatin.

Res Sq. 2024 Apr 2:rs.3.rs-4201913. doi: 10.21203/rs.3.rs-4201913/v1.

The Human Brainome: changes in expression of VGF, SPECC1L, HLA-DRA and RANBP3L act with APOE E4 to alter risk for late onset Alzheimer's disease.

Res Sq. 2023 Dec 14:rs.3.rs-3678057. doi: 10.21203/rs.3.rs-3678057/v1.

Graph Neural Networks in Cancer and Oncology Research: Emerging and Future Trends.

Cancers (Basel). 2023 Dec 15;15(24):5858. doi: 10.3390/cancers15245858.

Bayesian network models identify co-operative GPCR:G protein interactions that contribute to G protein coupling.

bioRxiv. 2023 Oct 12:2023.10.09.561618. doi: 10.1101/2023.10.09.561618.

Bow-tie architectures in biological and artificial neural networks: Implications for network evolution and assay design.

iScience. 2023 Jan 25;26(2):106041. doi: 10.1016/j.isci.2023.106041. eCollection 2023 Feb 17.

本文引用的文献

Identification of genetic interaction networks via an evolutionary algorithm evolved Bayesian network.

BioData Min. 2016 May 10;9:18. doi: 10.1186/s13040-016-0094-4. eCollection 2016.

Efficient Markov Blanket Discovery and Its Application.

IEEE Trans Cybern. 2017 May;47(5):1169-1179. doi: 10.1109/TCYB.2016.2539338. Epub 2016 Mar 24.

Reconstructing Causal Biological Networks through Active Learning.

PLoS One. 2016 Mar 1;11(3):e0150611. doi: 10.1371/journal.pone.0150611. eCollection 2016.

Identifying causal networks linking cancer processes and anti-tumor immunity using Bayesian network inference and metagene constructs.

Biotechnol Prog. 2016 Mar;32(2):470-9. doi: 10.1002/btpr.2230. Epub 2016 Feb 21.

Learning Predictive Interactions Using Information Gain and Bayesian Network Scoring.

PLoS One. 2015 Dec 1;10(12):e0143247. doi: 10.1371/journal.pone.0143247. eCollection 2015.

High-order dynamic Bayesian Network learning with hidden common causes for causal gene regulatory network.

BMC Bioinformatics. 2015 Nov 25;16:395. doi: 10.1186/s12859-015-0823-6.

Inference of regulatory networks with a convergence improved MCMC sampler.

BMC Bioinformatics. 2015 Sep 24;16:306. doi: 10.1186/s12859-015-0734-6.

Discovering Alzheimer Genetic Biomarkers Using Bayesian Networks.

Adv Bioinformatics. 2015;2015:639367. doi: 10.1155/2015/639367. Epub 2015 Aug 23.

A Dynamic Bayesian Network model for long-term simulation of clinical complications in type 1 diabetes.

J Biomed Inform. 2015 Oct;57:369-76. doi: 10.1016/j.jbi.2015.08.021. Epub 2015 Aug 29.

A tree-like Bayesian structure learning algorithm for small-sample datasets from complex biological model systems.

BMC Syst Biol. 2015 Aug 28;9:49. doi: 10.1186/s12918-015-0194-7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于从异构生物大数据和遗传数据推断和可视化贝叶斯网络的新算法与软件（BNOmics）

New Algorithm and Software (BNOmics) for Inferring and Visualizing Bayesian Networks from Heterogeneous Big Biological and Genetic Data.

作者信息

Gogoshin Grigoriy, Boerwinkle Eric, Rodin Andrei S

机构信息

1 Diabetes and Metabolism Research Institute , City of Hope, Duarte, California.

2 Human Genetics Center, School of Public Health, University of Texas Health Science Center , Houston, Texas.

出版信息

J Comput Biol. 2017 Apr;24(4):340-356. doi: 10.1089/cmb.2016.0100. Epub 2016 Sep 28.

DOI:10.1089/cmb.2016.0100

PMID:27681505

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5372779/

Abstract

摘要

用于从异构生物大数据和遗传数据推断和可视化贝叶斯网络的新算法与软件（BNOmics）

New Algorithm and Software (BNOmics) for Inferring and Visualizing Bayesian Networks from Heterogeneous Big Biological and Genetic Data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于从异构生物大数据和遗传数据推断和可视化贝叶斯网络的新算法与软件（BNOmics）

New Algorithm and Software (BNOmics) for Inferring and Visualizing Bayesian Networks from Heterogeneous Big Biological and Genetic Data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献