• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CausNet:基于世代排序的搜索,通过带有父集约束的动态规划寻找最优贝叶斯网络。

CausNet: generational orderings based search for optimal Bayesian networks via dynamic programming with parent set constraints.

机构信息

Division of Biostatistics, Department of Population and Public Health Sciences, University of Southern California, Los Angeles, USA.

出版信息

BMC Bioinformatics. 2023 Feb 14;24(1):46. doi: 10.1186/s12859-023-05159-6.

DOI:10.1186/s12859-023-05159-6
PMID:36788490
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9926787/
Abstract

BACKGROUND

Finding a globally optimal Bayesian Network using exhaustive search is a problem with super-exponential complexity, which severely restricts the number of variables that can feasibly be included. We implement a dynamic programming based algorithm with built-in dimensionality reduction and parent set identification. This reduces the search space substantially and can be applied to large-dimensional data. We use what we call 'generational orderings' based search for optimal networks, which is a novel way to efficiently search the space of possible networks given the possible parent sets. The algorithm supports both continuous and categorical data, as well as continuous, binary and survival outcomes.

RESULTS

We demonstrate the efficacy of our algorithm on both synthetic and real data. In simulations, our algorithm performs better than three state-of-art algorithms that are currently used extensively. We then apply it to an Ovarian Cancer gene expression dataset with 513 genes and a survival outcome. Our algorithm is able to find an optimal network describing the disease pathway consisting of 6 genes leading to the outcome node in just 3.4 min on a personal computer with a 2.3 GHz Intel Core i9 processor with 16 GB RAM.

CONCLUSIONS

Our generational orderings based search for optimal networks is both an efficient and highly scalable approach for finding optimal Bayesian Networks and can be applied to 1000 s of variables. Using specifiable parameters-correlation, FDR cutoffs, and in-degree-one can increase or decrease the number of nodes and density of the networks. Availability of two scoring option-BIC and Bge-and implementation for survival outcomes and mixed data types makes our algorithm very suitable for many types of high dimensional data in a variety of fields.

摘要

背景

使用穷举搜索找到全局最优贝叶斯网络是一个具有超指数复杂度的问题,这严重限制了可以实际包含的变量数量。我们实现了一种基于动态规划的算法,具有内置的降维和父集识别功能。这大大减少了搜索空间,可以应用于高维数据。我们使用我们所谓的“生成顺序”搜索最优网络,这是一种在给定可能的父集的情况下有效地搜索可能网络空间的新方法。该算法支持连续和分类数据,以及连续、二值和生存结局。

结果

我们在合成数据和真实数据上展示了我们算法的有效性。在模拟中,我们的算法比目前广泛使用的三种最先进的算法表现更好。然后,我们将其应用于一个包含 513 个基因和一个生存结局的卵巢癌基因表达数据集。我们的算法能够在个人计算机上仅用 3.4 分钟找到一个描述疾病途径的最优网络,该网络由 6 个基因组成,导致结局节点,计算机的处理器为 2.3GHz Intel Core i9,内存为 16GB。

结论

我们的基于生成顺序的最优网络搜索是一种高效且高度可扩展的方法,可用于寻找最优贝叶斯网络,并且可以应用于数千个变量。使用可指定的参数-相关性、FDR 截止值和入度一,可以增加或减少节点的数量和网络的密度。提供两种评分选项-BIC 和 Bge-以及对生存结局和混合数据类型的实现,使得我们的算法非常适合各种领域的多种高维数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/9a289856ecfd/12859_2023_5159_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/702ac5a9a5f2/12859_2023_5159_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/4a55c6041015/12859_2023_5159_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/fa6517e62360/12859_2023_5159_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/14cf74a34929/12859_2023_5159_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/8f9038824524/12859_2023_5159_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/2097f8879a8f/12859_2023_5159_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/6477c63c4b64/12859_2023_5159_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/3a631b2744d1/12859_2023_5159_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/d55fac92e5e7/12859_2023_5159_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/9a289856ecfd/12859_2023_5159_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/702ac5a9a5f2/12859_2023_5159_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/4a55c6041015/12859_2023_5159_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/fa6517e62360/12859_2023_5159_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/14cf74a34929/12859_2023_5159_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/8f9038824524/12859_2023_5159_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/2097f8879a8f/12859_2023_5159_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/6477c63c4b64/12859_2023_5159_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/3a631b2744d1/12859_2023_5159_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/d55fac92e5e7/12859_2023_5159_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b16/9926787/9a289856ecfd/12859_2023_5159_Fig10_HTML.jpg

相似文献

1
CausNet: generational orderings based search for optimal Bayesian networks via dynamic programming with parent set constraints.CausNet:基于世代排序的搜索,通过带有父集约束的动态规划寻找最优贝叶斯网络。
BMC Bioinformatics. 2023 Feb 14;24(1):46. doi: 10.1186/s12859-023-05159-6.
2
CausNet-partial : 'Partial Generational Orderings' based search for optimal sparse Bayesian networks via dynamic programming with parent set constraints.因果网络局部:基于“部分代序”,通过带父节点集约束的动态规划搜索最优稀疏贝叶斯网络。
Res Sq. 2024 Mar 7:rs.3.rs-4021074. doi: 10.21203/rs.3.rs-4021074/v1.
3
An improved Bayesian network method for reconstructing gene regulatory network based on candidate auto selection.基于候选自动选择的基因调控网络重建的改进贝叶斯网络方法。
BMC Genomics. 2017 Nov 17;18(Suppl 9):844. doi: 10.1186/s12864-017-4228-y.
4
The dynamic programming high-order Dynamic Bayesian Networks learning for identifying effective connectivity in human brain from fMRI.用于从功能磁共振成像中识别人类大脑有效连接性的动态规划高阶动态贝叶斯网络学习。
J Neurosci Methods. 2017 Jun 15;285:33-44. doi: 10.1016/j.jneumeth.2017.05.009. Epub 2017 May 8.
5
Bayesian Orthogonal Least Squares (BOLS) algorithm for reverse engineering of gene regulatory networks.用于基因调控网络逆向工程的贝叶斯正交最小二乘法(BOLS)算法
BMC Bioinformatics. 2007 Jul 13;8:251. doi: 10.1186/1471-2105-8-251.
6
SAGA: a hybrid search algorithm for Bayesian Network structure learning of transcriptional regulatory networks.SAGA:一种用于转录调控网络贝叶斯网络结构学习的混合搜索算法。
J Biomed Inform. 2015 Feb;53:27-35. doi: 10.1016/j.jbi.2014.08.010. Epub 2014 Aug 30.
7
Empirical evaluation of scoring functions for Bayesian network model selection.贝叶斯网络模型选择评分函数的实证评估。
BMC Bioinformatics. 2012;13 Suppl 15(Suppl 15):S14. doi: 10.1186/1471-2105-13-S15-S14. Epub 2012 Sep 11.
8
Learning Moral Graphs in Construction of High-Dimensional Bayesian Networks for Mixed Data.学习构建混合数据高维贝叶斯网络中的道德图。
Neural Comput. 2019 Jun;31(6):1183-1214. doi: 10.1162/neco_a_01190. Epub 2019 Apr 12.
9
3off2: A network reconstruction algorithm based on 2-point and 3-point information statistics.3off2:一种基于两点和三点信息统计的网络重建算法。
BMC Bioinformatics. 2016 Jan 20;17 Suppl 2(Suppl 2):12. doi: 10.1186/s12859-015-0856-x.
10
Kernel-imbedded Gaussian processes for disease classification using microarray gene expression data.使用微阵列基因表达数据的用于疾病分类的核嵌入高斯过程。
BMC Bioinformatics. 2007 Feb 28;8:67. doi: 10.1186/1471-2105-8-67.

引用本文的文献

1
CausNet-partial: 'Partial Generational Orderings' based search for optimal sparse Bayesian networks via dynamic programming with parent set constraints.因果网络局部搜索:基于“部分世代排序”,通过带父节点集约束的动态规划来搜索最优稀疏贝叶斯网络。
PLoS One. 2025 Jun 10;20(6):e0324622. doi: 10.1371/journal.pone.0324622. eCollection 2025.

本文引用的文献

1
fdrci: FDR confidence interval selection and adjustment for large-scale hypothesis testing.FDRCI:大规模假设检验中的FDR置信区间选择与调整
Bioinform Adv. 2022 Jun 13;2(1):vbac047. doi: 10.1093/bioadv/vbac047. eCollection 2022.
2
Prognostic gene expression signature for high-grade serous ovarian cancer.用于高级别浆液性卵巢癌的预后基因表达特征。
Ann Oncol. 2020 Sep;31(9):1240-1250. doi: 10.1016/j.annonc.2020.05.019. Epub 2020 May 28.
3
Applications of Bayesian network models in predicting types of hematological malignancies.
贝叶斯网络模型在预测血液系统恶性肿瘤类型中的应用。
Sci Rep. 2018 May 3;8(1):6951. doi: 10.1038/s41598-018-24758-5.
4
Network Diffusion-Based Prioritization of Autism Risk Genes Identifies Significantly Connected Gene Modules.基于网络扩散的自闭症风险基因优先级排序识别出显著连接的基因模块。
Front Genet. 2017 Sep 25;8:129. doi: 10.3389/fgene.2017.00129. eCollection 2017.
5
A comparison of methods for inferring causal relationships between genotype and phenotype using additional biological measurements.使用额外生物学测量推断基因型与表型之间因果关系的方法比较。
Genet Epidemiol. 2017 Nov;41(7):577-586. doi: 10.1002/gepi.22061. Epub 2017 Jul 10.
6
Network diffusion-based analysis of high-throughput data for the detection of differentially enriched modules.基于网络扩散的高通量数据分析检测差异富集模块。
Sci Rep. 2016 Oct 12;6:34841. doi: 10.1038/srep34841.
7
Bayesian networks in neuroscience: a survey.贝叶斯网络在神经科学中的应用:综述。
Front Comput Neurosci. 2014 Oct 16;8:131. doi: 10.3389/fncom.2014.00131. eCollection 2014.
8
Using Bayesian networks to discover relations between genes, environment, and disease.使用贝叶斯网络发现基因、环境和疾病之间的关系。
BioData Min. 2013 Mar 21;6(1):6. doi: 10.1186/1756-0381-6-6.