• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

贝叶斯网络模型中基因表达数据的先验生物学知识的定量利用。

Quantitative utilization of prior biological knowledge in the Bayesian network modeling of gene expression data.

机构信息

Department of Physics, University of Alabama, Birmingham, AL 35294, USA.

出版信息

BMC Bioinformatics. 2011 Aug 31;12:359. doi: 10.1186/1471-2105-12-359.

DOI:10.1186/1471-2105-12-359
PMID:21884587
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3203352/
Abstract

BACKGROUND

Bayesian Network (BN) is a powerful approach to reconstructing genetic regulatory networks from gene expression data. However, expression data by itself suffers from high noise and lack of power. Incorporating prior biological knowledge can improve the performance. As each type of prior knowledge on its own may be incomplete or limited by quality issues, integrating multiple sources of prior knowledge to utilize their consensus is desirable.

RESULTS

We introduce a new method to incorporate the quantitative information from multiple sources of prior knowledge. It first uses the Naïve Bayesian classifier to assess the likelihood of functional linkage between gene pairs based on prior knowledge. In this study we included cocitation in PubMed and schematic similarity in Gene Ontology annotation. A candidate network edge reservoir is then created in which the copy number of each edge is proportional to the estimated likelihood of linkage between the two corresponding genes. In network simulation the Markov Chain Monte Carlo sampling algorithm is adopted, and samples from this reservoir at each iteration to generate new candidate networks. We evaluated the new algorithm using both simulated and real gene expression data including that from a yeast cell cycle and a mouse pancreas development/growth study. Incorporating prior knowledge led to a ~2 fold increase in the number of known transcription regulations recovered, without significant change in false positive rate. In contrast, without the prior knowledge BN modeling is not always better than a random selection, demonstrating the necessity in network modeling to supplement the gene expression data with additional information.

CONCLUSION

our new development provides a statistical means to utilize the quantitative information in prior biological knowledge in the BN modeling of gene expression data, which significantly improves the performance.

摘要

背景

贝叶斯网络(BN)是一种从基因表达数据中重建遗传调控网络的强大方法。然而,表达数据本身存在高噪声和缺乏信息的问题。结合先验生物学知识可以提高性能。由于每种先验知识本身可能不完整或受到质量问题的限制,因此整合多种来源的先验知识以利用它们的共识是可取的。

结果

我们介绍了一种新的方法来整合来自多种来源的先验知识的定量信息。它首先使用朴素贝叶斯分类器根据先验知识评估基因对之间功能关联的可能性。在本研究中,我们包括 PubMed 中的共引和 Gene Ontology 注释中的示意图相似性。然后创建一个候选网络边缘库,其中每个边缘的副本数与两个对应基因之间链接的估计可能性成正比。在网络模拟中,采用马尔可夫链蒙特卡罗抽样算法,从该库中在每次迭代时采样以生成新的候选网络。我们使用模拟和真实基因表达数据(包括酵母细胞周期和小鼠胰腺发育/生长研究的数据)评估了新算法。结合先验知识可将已知转录调控的数量增加约 2 倍,而假阳性率没有显著变化。相比之下,如果没有先验知识,BN 建模并不总是优于随机选择,这表明在网络建模中需要用额外的信息来补充基因表达数据。

结论

我们的新方法为利用 BN 对基因表达数据进行建模时的先验生物学知识中的定量信息提供了一种统计手段,显著提高了性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/412c/3203352/c43a689c7d23/1471-2105-12-359-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/412c/3203352/a890a8d2553e/1471-2105-12-359-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/412c/3203352/b870d08cf26e/1471-2105-12-359-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/412c/3203352/4cd91c5de85e/1471-2105-12-359-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/412c/3203352/81687aedf2a5/1471-2105-12-359-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/412c/3203352/6a21bf7c0a1f/1471-2105-12-359-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/412c/3203352/eb85a79ae00e/1471-2105-12-359-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/412c/3203352/c43a689c7d23/1471-2105-12-359-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/412c/3203352/a890a8d2553e/1471-2105-12-359-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/412c/3203352/b870d08cf26e/1471-2105-12-359-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/412c/3203352/4cd91c5de85e/1471-2105-12-359-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/412c/3203352/81687aedf2a5/1471-2105-12-359-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/412c/3203352/6a21bf7c0a1f/1471-2105-12-359-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/412c/3203352/eb85a79ae00e/1471-2105-12-359-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/412c/3203352/c43a689c7d23/1471-2105-12-359-7.jpg

相似文献

1
Quantitative utilization of prior biological knowledge in the Bayesian network modeling of gene expression data.贝叶斯网络模型中基因表达数据的先验生物学知识的定量利用。
BMC Bioinformatics. 2011 Aug 31;12:359. doi: 10.1186/1471-2105-12-359.
2
Reverse engineering module networks by PSO-RNN hybrid modeling.通过粒子群优化-递归神经网络混合建模对模块网络进行逆向工程。
BMC Genomics. 2009 Jul 7;10 Suppl 1(Suppl 1):S15. doi: 10.1186/1471-2164-10-S1-S15.
3
Bayesian Orthogonal Least Squares (BOLS) algorithm for reverse engineering of gene regulatory networks.用于基因调控网络逆向工程的贝叶斯正交最小二乘法(BOLS)算法
BMC Bioinformatics. 2007 Jul 13;8:251. doi: 10.1186/1471-2105-8-251.
4
Global protein function annotation through mining genome-scale data in yeast Saccharomyces cerevisiae.通过挖掘酿酒酵母基因组规模数据进行全球蛋白质功能注释。
Nucleic Acids Res. 2004 Dec 7;32(21):6414-24. doi: 10.1093/nar/gkh978. Print 2004.
5
A copula method for modeling directional dependence of genes.一种用于建模基因方向依赖性的共现方法。
BMC Bioinformatics. 2008 May 1;9:225. doi: 10.1186/1471-2105-9-225.
6
Improvements in the reconstruction of time-varying gene regulatory networks: dynamic programming and regularization by information sharing among genes.时变基因调控网络重建的改进:通过基因间信息共享的动态规划和正则化。
Bioinformatics. 2011 Mar 1;27(5):693-9. doi: 10.1093/bioinformatics/btq711. Epub 2010 Dec 21.
7
Modular analysis of the probabilistic genetic interaction network.概率遗传交互网络的模块化分析。
Bioinformatics. 2011 Mar 15;27(6):853-9. doi: 10.1093/bioinformatics/btr031. Epub 2011 Jan 28.
8
Gene regulatory network inference based on a nonhomogeneous dynamic Bayesian network model with an improved Markov Monte Carlo sampling.基于改进的马尔可夫蒙特卡罗抽样的非齐次动态贝叶斯网络模型的基因调控网络推断。
BMC Bioinformatics. 2023 Jun 24;24(1):264. doi: 10.1186/s12859-023-05381-2.
9
An improved Bayesian network method for reconstructing gene regulatory network based on candidate auto selection.基于候选自动选择的基因调控网络重建的改进贝叶斯网络方法。
BMC Genomics. 2017 Nov 17;18(Suppl 9):844. doi: 10.1186/s12864-017-4228-y.
10
Using Bayesian networks to analyze expression data.使用贝叶斯网络分析表达数据。
J Comput Biol. 2000;7(3-4):601-20. doi: 10.1089/106652700750050961.

引用本文的文献

1
Correlation Imputation for Single-Cell RNA-seq.单细胞 RNA-seq 的关联插补。
J Comput Biol. 2022 May;29(5):465-482. doi: 10.1089/cmb.2021.0403. Epub 2022 Mar 21.
2
Correlation Imputation in Single cell RNA-seq using Auxiliary Information and Ensemble Learning.利用辅助信息和集成学习进行单细胞RNA测序中的相关性插补
ACM BCB. 2020 Sep;2020. doi: 10.1145/3388440.3412462.
3
Comprehensive network modeling from single cell RNA sequencing of human and mouse reveals well conserved transcription regulation of hematopoiesis.

本文引用的文献

1
Global analysis of phase locking in gene expression during cell cycle: the potential in network modeling.细胞周期中基因表达锁相的全局分析:网络建模中的潜力。
BMC Syst Biol. 2010 Dec 3;4:167. doi: 10.1186/1752-0509-4-167.
2
Characterizing dynamic changes in the human blood transcriptional network.描述人类血液转录网络的动态变化。
PLoS Comput Biol. 2010 Feb 12;6(2):e1000671. doi: 10.1371/journal.pcbi.1000671.
3
Literature-based priors for gene regulatory networks.基于文献的基因调控网络先验知识。
通过对人类和小鼠的单细胞RNA测序进行综合网络建模,揭示了造血过程中高度保守的转录调控。
BMC Genomics. 2020 Dec 29;21(Suppl 11):849. doi: 10.1186/s12864-020-07241-2.
4
Applications of Bayesian network models in predicting types of hematological malignancies.贝叶斯网络模型在预测血液系统恶性肿瘤类型中的应用。
Sci Rep. 2018 May 3;8(1):6951. doi: 10.1038/s41598-018-24758-5.
5
Differential Regulatory Analysis Based on Coexpression Network in Cancer Research.癌症研究中基于共表达网络的差异调控分析
Biomed Res Int. 2016;2016:4241293. doi: 10.1155/2016/4241293. Epub 2016 Aug 11.
6
Bayesian modeling suggests that IL-12 (p40), IL-13 and MCP-1 drive murine cytokine networks in vivo.贝叶斯模型表明,白细胞介素-12(p40)、白细胞介素-13和单核细胞趋化蛋白-1在体内驱动小鼠细胞因子网络。
BMC Syst Biol. 2015 Nov 9;9:76. doi: 10.1186/s12918-015-0226-3.
7
Identifying pathogenic processes by integrating microarray data with prior knowledge.通过将微阵列数据与先验知识相结合来识别致病过程。
BMC Bioinformatics. 2014 Apr 24;15:115. doi: 10.1186/1471-2105-15-115.
8
Inference and validation of predictive gene networks from biomedical literature and gene expression data.基于生物医学文献和基因表达数据的预测性基因网络的推断与验证。
Genomics. 2014 May-Jun;103(5-6):329-36. doi: 10.1016/j.ygeno.2014.03.004. Epub 2014 Mar 29.
9
Decision tree-based method for integrating gene expression, demographic, and clinical data to determine disease endotypes.基于决策树的方法,用于整合基因表达、人口统计学和临床数据以确定疾病内型。
BMC Syst Biol. 2013 Nov 4;7:119. doi: 10.1186/1752-0509-7-119.
10
Boosting probabilistic graphical model inference by incorporating prior knowledge from multiple sources.通过整合来自多个来源的先验知识来提高概率图模型推断能力。
PLoS One. 2013 Jun 24;8(6):e67410. doi: 10.1371/journal.pone.0067410. Print 2013.
Bioinformatics. 2009 Jul 15;25(14):1768-74. doi: 10.1093/bioinformatics/btp277. Epub 2009 Apr 23.
4
Diagnosis and clinical management of spinal muscular atrophy.脊髓性肌萎缩症的诊断与临床管理
Phys Med Rehabil Clin N Am. 2008 Aug;19(3):661-80, xii. doi: 10.1016/j.pmr.2008.02.004.
5
Seeded Bayesian Networks: constructing genetic networks from microarray data.种子贝叶斯网络:从微阵列数据构建遗传网络。
BMC Syst Biol. 2008 Jul 4;2:57. doi: 10.1186/1752-0509-2-57.
6
Bayesian integration of biological prior knowledge into the reconstruction of gene regulatory networks with Bayesian networks.利用贝叶斯网络将生物学先验知识贝叶斯整合到基因调控网络的重建中。
Comput Syst Bioinformatics Conf. 2007;6:85-95.
7
A framework for elucidating regulatory networks based on prior information and expression data.一种基于先验信息和表达数据阐明调控网络的框架。
Ann N Y Acad Sci. 2007 Dec;1115:240-8. doi: 10.1196/annals.1407.002. Epub 2007 Oct 9.
8
An improved, bias-reduced probabilistic functional gene network of baker's yeast, Saccharomyces cerevisiae.一种经过改进、偏差降低的酿酒酵母概率功能基因网络。
PLoS One. 2007 Oct 3;2(10):e988. doi: 10.1371/journal.pone.0000988.
9
A primer on learning in Bayesian networks for computational biology.计算生物学中贝叶斯网络学习入门
PLoS Comput Biol. 2007 Aug;3(8):e129. doi: 10.1371/journal.pcbi.0030129.
10
A statistical method to incorporate biological knowledge for generating testable novel gene regulatory interactions from microarray experiments.一种整合生物学知识以从微阵列实验中生成可测试的新型基因调控相互作用的统计方法。
BMC Bioinformatics. 2007 Aug 29;8:317. doi: 10.1186/1471-2105-8-317.