• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

马尔可夫链蒙特卡罗在主动模块识别问题中的应用。

Markov chain Monte Carlo for active module identification problem.

机构信息

Computer Technologies Laboratory, ITMO University, Saint Petersburg, Russia.

JetBrains Research, Saint Petersburg, Russia.

出版信息

BMC Bioinformatics. 2020 Nov 18;21(Suppl 6):261. doi: 10.1186/s12859-020-03572-9.

DOI:10.1186/s12859-020-03572-9
PMID:33203350
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7672893/
Abstract

BACKGROUND

Integrative network methods are commonly used for interpretation of high-throughput experimental biological data: transcriptomics, proteomics, metabolomics and others. One of the common approaches is finding a connected subnetwork of a global interaction network that best encompasses significant individual changes in the data and represents a so-called active module. Usually methods implementing this approach find a single subnetwork and thus solve a hard classification problem for vertices. This subnetwork inherently contains erroneous vertices, while no instrument is provided to estimate the confidence level of any particular vertex inclusion. To address this issue, in the current study we consider the active module problem as a soft classification problem.

RESULTS

We propose a method to estimate probabilities of each vertex to belong to the active module based on Markov chain Monte Carlo (MCMC) subnetwork sampling. As an example of the performance of our method on real data, we run it on two gene expression datasets. For the first many-replicate expression dataset we show that the proposed approach is consistent with an existing resampling-based method. On the second dataset the jackknife resampling method is inapplicable due to the small number of biological replicates, but the MCMC method can be run and shows high classification performance.

CONCLUSIONS

The proposed method allows to estimate the probability that an individual vertex belongs to the active module as well as the false discovery rate (FDR) for a given set of vertices. Given the estimated probabilities, it becomes possible to provide a connected subgraph in a consistent manner for any given FDR level: no vertex can disappear when the FDR level is relaxed. We show, on both simulated and real datasets, that the proposed method has good computational performance and high classification accuracy.

摘要

背景

整合网络方法常用于解释高通量实验生物学数据:转录组学、蛋白质组学、代谢组学等。常见的方法之一是找到全局相互作用网络的一个连通子网络,该子网络最好包含数据中显著的个体变化,并代表所谓的活性模块。通常,实现此方法的方法会找到单个子网,从而为顶点解决硬分类问题。该子网本质上包含错误的顶点,而没有仪器可以估计任何特定顶点包含的置信度水平。为了解决这个问题,在当前的研究中,我们将活性模块问题视为软分类问题。

结果

我们提出了一种基于马尔可夫链蒙特卡罗(MCMC)子网采样来估计每个顶点属于活性模块的概率的方法。作为我们的方法在真实数据上的性能的一个例子,我们在两个基因表达数据集上运行它。对于第一个具有许多重复表达的数据集,我们表明所提出的方法与基于现有重采样的方法一致。对于第二个数据集,由于生物学重复数较少,因此无法使用自举重采样方法,但可以运行 MCMC 方法,并且显示出较高的分类性能。

结论

所提出的方法允许估计个体顶点属于活性模块的概率以及给定顶点集的错误发现率(FDR)。给定估计的概率,就可以以一致的方式为任何给定的 FDR 水平提供一个连通的子图:当 FDR 水平放宽时,没有顶点会消失。我们在模拟和真实数据集上都表明,所提出的方法具有良好的计算性能和高分类精度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ef5/7672893/cb618dcda34a/12859_2020_3572_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ef5/7672893/a668de09ea5f/12859_2020_3572_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ef5/7672893/d67abb3d1952/12859_2020_3572_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ef5/7672893/6dd44bae4cf1/12859_2020_3572_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ef5/7672893/6ef27db608e2/12859_2020_3572_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ef5/7672893/6a252150c009/12859_2020_3572_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ef5/7672893/43575fd3ca41/12859_2020_3572_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ef5/7672893/cb618dcda34a/12859_2020_3572_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ef5/7672893/a668de09ea5f/12859_2020_3572_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ef5/7672893/d67abb3d1952/12859_2020_3572_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ef5/7672893/6dd44bae4cf1/12859_2020_3572_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ef5/7672893/6ef27db608e2/12859_2020_3572_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ef5/7672893/6a252150c009/12859_2020_3572_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ef5/7672893/43575fd3ca41/12859_2020_3572_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ef5/7672893/cb618dcda34a/12859_2020_3572_Fig7_HTML.jpg

相似文献

1
Markov chain Monte Carlo for active module identification problem.马尔可夫链蒙特卡罗在主动模块识别问题中的应用。
BMC Bioinformatics. 2020 Nov 18;21(Suppl 6):261. doi: 10.1186/s12859-020-03572-9.
2
Guided tree topology proposals for Bayesian phylogenetic inference.贝叶斯系统发育推断的引导树拓扑提议。
Syst Biol. 2012 Jan;61(1):1-11. doi: 10.1093/sysbio/syr074. Epub 2011 Aug 9.
3
Markov chain Monte Carlo simulation of a Bayesian mixture model for gene network inference.贝叶斯混合模型的马尔可夫链蒙特卡罗模拟在基因网络推断中的应用。
Genes Genomics. 2019 May;41(5):547-555. doi: 10.1007/s13258-019-00789-8. Epub 2019 Feb 11.
4
A simple introduction to Markov Chain Monte-Carlo sampling.马尔可夫链蒙特卡罗采样简介。
Psychon Bull Rev. 2018 Feb;25(1):143-154. doi: 10.3758/s13423-016-1015-8.
5
Searching for convergence in phylogenetic Markov chain Monte Carlo.在系统发育马尔可夫链蒙特卡罗方法中寻找收敛性。
Syst Biol. 2006 Aug;55(4):553-65. doi: 10.1080/10635150600812544.
6
Comprehensive benchmarking of Markov chain Monte Carlo methods for dynamical systems.动态系统马尔可夫链蒙特卡罗方法的综合基准测试
BMC Syst Biol. 2017 Jun 24;11(1):63. doi: 10.1186/s12918-017-0433-1.
7
Connecting PM and MAP in Bayesian spectral deconvolution by extending exchange Monte Carlo method and using multiple data sets.通过扩展交换蒙特卡罗方法并使用多个数据集,将 PM 和 MAP 连接到贝叶斯谱分解中。
Neural Netw. 2019 Oct;118:159-166. doi: 10.1016/j.neunet.2019.05.004. Epub 2019 May 15.
8
Inference of regulatory networks with a convergence improved MCMC sampler.使用收敛性改进的马尔可夫链蒙特卡罗采样器推断调控网络。
BMC Bioinformatics. 2015 Sep 24;16:306. doi: 10.1186/s12859-015-0734-6.
9
Sequential Gibbs Sampling Algorithm for Cognitive Diagnosis Models with Many Attributes.具有多个属性的认知诊断模型的序贯 Gibbs 抽样算法。
Multivariate Behav Res. 2022 Sep-Oct;57(5):840-858. doi: 10.1080/00273171.2021.1896352. Epub 2021 Mar 23.
10
Bayesian models and Markov chain Monte Carlo methods for protein motifs with the secondary characteristics.具有二级特征的蛋白质基序的贝叶斯模型和马尔可夫链蒙特卡罗方法。
J Comput Biol. 2005 Sep;12(7):952-70. doi: 10.1089/cmb.2005.12.952.

引用本文的文献

1
Covariate dependent Markov chains constructed with gradient boost modeling can effectively generate long-term predictions of obesity trends.基于梯度提升建模构建的协变量相关马尔可夫链,可以有效地生成肥胖趋势的长期预测。
BMC Res Notes. 2023 Nov 24;16(1):346. doi: 10.1186/s13104-023-06610-w.
2
Histology and transcriptome insights into the early processes of intestinal anastomotic healing: a rat model.组织学和转录组学对肠道吻合口愈合早期过程的研究:大鼠模型。
BJS Open. 2023 Sep 5;7(5). doi: 10.1093/bjsopen/zrad099.
3
Transcriptomic Signatures of Human Immunodeficiency Virus Post-Treatment Control.

本文引用的文献

1
Identifying significantly impacted pathways: a comprehensive review and assessment.识别受显著影响的途径:全面回顾与评估。
Genome Biol. 2019 Oct 9;20(1):203. doi: 10.1186/s13059-019-1790-4.
2
A Comprehensive Survey of Tools and Software for Active Subnetwork Identification.用于活跃子网识别的工具和软件综合调查。
Front Genet. 2019 Mar 5;10:155. doi: 10.3389/fgene.2019.00155. eCollection 2019.
3
A scored human protein-protein interaction network to catalyze genomic interpretation.一个用于催化基因组解读的评分人类蛋白质-蛋白质相互作用网络。
人类免疫缺陷病毒治疗后控制的转录组特征。
J Virol. 2023 Jan 31;97(1):e0125422. doi: 10.1128/jvi.01254-22. Epub 2022 Dec 21.
Nat Methods. 2017 Jan;14(1):61-64. doi: 10.1038/nmeth.4083. Epub 2016 Nov 28.
4
Oxygen regulates proliferation of neural stem cells through Wnt/β-catenin signalling.氧气通过Wnt/β-连环蛋白信号通路调节神经干细胞的增殖。
Mol Cell Neurosci. 2015 Jul;67:84-92. doi: 10.1016/j.mcn.2015.06.006. Epub 2015 Jun 14.
5
Network integration of parallel metabolic and transcriptional data reveals metabolic modules that regulate macrophage polarization.平行代谢和转录组数据的网络整合揭示了调节巨噬细胞极化的代谢模块。
Immunity. 2015 Mar 17;42(3):419-30. doi: 10.1016/j.immuni.2015.02.005.
6
Pan-cancer network analysis identifies combinations of rare somatic mutations across pathways and protein complexes.泛癌网络分析确定了跨通路和蛋白质复合物的罕见体细胞突变组合。
Nat Genet. 2015 Feb;47(2):106-14. doi: 10.1038/ng.3168. Epub 2014 Dec 15.
7
Integrative approaches for finding modular structure in biological networks.综合方法寻找生物网络中的模块结构。
Nat Rev Genet. 2013 Oct;14(10):719-32. doi: 10.1038/nrg3552.
8
Network enrichment analysis: extension of gene-set enrichment analysis to gene networks.网络富集分析:基因集富集分析向基因网络的扩展。
BMC Bioinformatics. 2012 Sep 11;13:226. doi: 10.1186/1471-2105-13-226.
9
Robustness and accuracy of functional modules in integrated network analysis.综合网络分析中功能模块的稳健性和准确性。
Bioinformatics. 2012 Jul 15;28(14):1887-94. doi: 10.1093/bioinformatics/bts265. Epub 2012 May 11.
10
Differential network biology.差异网络生物学。
Mol Syst Biol. 2012 Jan 17;8:565. doi: 10.1038/msb.2011.99.