MetaPath：识别宏基因组数据集中差异丰富的代谢途径。

MetaPath: identifying differentially abundant metabolic pathways in metagenomic datasets.

作者信息

Liu Bo, Pop Mihai

机构信息

Center for Bioinformatics and Computational Biology, Institute for Advanced Computer Studies, University of Maryland, College Park, MD 20742, USA.

出版信息

BMC Proc. 2011 May 28;5 Suppl 2(Suppl 2):S9. doi: 10.1186/1753-6561-5-S2-S9.

DOI:10.1186/1753-6561-5-S2-S9

PMID:21554767

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3090767/

Abstract

BACKGROUND

Enabled by rapid advances in sequencing technology, metagenomic studies aim to characterize entire communities of microbes bypassing the need for culturing individual bacterial members. One major goal of metagenomic studies is to identify specific functional adaptations of microbial communities to their habitats. The functional profile and the abundances for a sample can be estimated by mapping metagenomic sequences to the global metabolic network consisting of thousands of molecular reactions. Here we describe a powerful analytical method (MetaPath) that can identify differentially abundant pathways in metagenomic datasets, relying on a combination of metagenomic sequence data and prior metabolic pathway knowledge.

METHODS

First, we introduce a scoring function for an arbitrary subnetwork and find the max-weight subnetwork in the global network by a greedy search algorithm. Then we compute two p values (pabund and pstruct) using nonparametric approaches to answer two different statistical questions: (1) is this subnetwork differentically abundant? (2) What is the probability of finding such good subnetworks by chance given the data and network structure? Finally, significant metabolic subnetworks are discovered based on these two p values.

RESULTS

In order to validate our methods, we have designed a simulated metabolic pathways dataset and show that MetaPath outperforms other commonly used approaches. We also demonstrate the power of our methods in analyzing two publicly available metagenomic datasets, and show that the subnetworks identified by MetaPath provide valuable insights into the biological activities of the microbiome.

CONCLUSIONS

We have introduced a statistical method for finding significant metabolic subnetworks from metagenomic datasets. Compared with previous methods, results from MetaPath are more robust against noise in the data, and have significantly higher sensitivity and specificity (when tested on simulated datasets). When applied to two publicly available metagenomic datasets, the output of MetaPath is consistent with previous observations and also provides several new insights into the metabolic activity of the gut microbiome. The software is freely available at http://metapath.cbcb.umd.edu.

摘要

背景

得益于测序技术的飞速发展，宏基因组学研究旨在对整个微生物群落进行特征描述，而无需培养单个细菌成员。宏基因组学研究的一个主要目标是确定微生物群落对其栖息地的特定功能适应性。通过将宏基因组序列映射到由数千个分子反应组成的全球代谢网络，可以估计样本的功能概况和丰度。在此，我们描述了一种强大的分析方法（MetaPath），该方法可以结合宏基因组序列数据和先前的代谢途径知识，识别宏基因组数据集中差异丰富的途径。

方法

首先，我们为任意子网络引入一个评分函数，并通过贪婪搜索算法在全局网络中找到最大权重子网络。然后，我们使用非参数方法计算两个p值（pabund和pstruct），以回答两个不同的统计问题：（1）这个子网络的丰度是否有差异？（2）鉴于数据和网络结构，偶然发现如此好的子网络的概率是多少？最后，基于这两个p值发现显著的代谢子网络。

结果

为了验证我们的方法，我们设计了一个模拟代谢途径数据集，并表明MetaPath优于其他常用方法。我们还展示了我们的方法在分析两个公开可用的宏基因组数据集方面的能力，并表明MetaPath识别出的子网络为微生物组的生物活性提供了有价值的见解。

结论

我们引入了一种从宏基因组数据集中寻找显著代谢子网络的统计方法。与以前的方法相比，MetaPath的结果对数据中的噪声更具鲁棒性，并且在模拟数据集上测试时具有显著更高的灵敏度和特异性。当应用于两个公开可用的宏基因组数据集时，MetaPath的输出与先前的观察结果一致，并且还为肠道微生物组的代谢活性提供了一些新的见解。该软件可在http://metapath.cbcb.umd.edu免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eb4b/3090767/ef92a9d38eb5/1753-6561-5-S2-S9-1.jpg

相似文献

MetaPath: identifying differentially abundant metabolic pathways in metagenomic datasets.MetaPath：识别宏基因组数据集中差异丰富的代谢途径。

BMC Proc. 2011 May 28;5 Suppl 2(Suppl 2):S9. doi: 10.1186/1753-6561-5-S2-S9.

Statistical methods for detecting differentially abundant features in clinical metagenomic samples.用于检测临床宏基因组样本中差异丰度特征的统计方法。

PLoS Comput Biol. 2009 Apr;5(4):e1000352. doi: 10.1371/journal.pcbi.1000352. Epub 2009 Apr 10.

GSNFS: Gene subnetwork biomarker identification of lung cancer expression data.GSNFS：肺癌表达数据的基因子网生物标志物识别

BMC Med Genomics. 2016 Dec 5;9(Suppl 3):70. doi: 10.1186/s12920-016-0231-4.

Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences.从宏基因组鸟枪法序列中准确快速地估算分类分布情况。

BMC Genomics. 2011;12 Suppl 2(Suppl 2):S4. doi: 10.1186/1471-2164-12-S2-S4. Epub 2011 Jul 27.

COGNIZER: A Framework for Functional Annotation of Metagenomic Datasets.认知器：宏基因组数据集功能注释框架

PLoS One. 2015 Nov 11;10(11):e0142102. doi: 10.1371/journal.pone.0142102. eCollection 2015.

Metapath Aggregated Graph Neural Network and Tripartite Heterogeneous Networks for Microbe-Disease Prediction.用于微生物-疾病预测的元路径聚合图神经网络和三方异构网络

Front Microbiol. 2022 May 31;13:919380. doi: 10.3389/fmicb.2022.919380. eCollection 2022.

Identification of differentially expressed subnetworks based on multivariate ANOVA.基于多变量方差分析的差异表达子网的识别。

BMC Bioinformatics. 2009 Apr 30;10:128. doi: 10.1186/1471-2105-10-128.

Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学：基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍

Natural history bycatch: a pipeline for identifying metagenomic sequences in RADseq data.自然历史兼捕：一种在RADseq数据中识别宏基因组序列的流程。

PeerJ. 2018 Apr 16;6:e4662. doi: 10.7717/peerj.4662. eCollection 2018.

A user's guide to quantitative and comparative analysis of metagenomic datasets.宏基因组数据集定量与比较分析用户指南

Methods Enzymol. 2013;531:525-47. doi: 10.1016/B978-0-12-407863-5.00023-X.

引用本文的文献

Potential Schizophrenia Disease-Related Genes Prediction Using Metagraph Representations Based on a Protein-Protein Interaction Keyword Network: Framework Development and Validation.基于蛋白质-蛋白质相互作用关键词网络的元图表示法预测潜在的精神分裂症相关基因：框架开发与验证

JMIR Form Res. 2023 Nov 15;7:e50998. doi: 10.2196/50998.

BiG-MAP: an Automated Pipeline To Profile Metabolic Gene Cluster Abundance and Expression in Microbiomes.BiG-MAP：一种用于分析微生物群落中代谢基因簇丰度和表达的自动化流程。

mSystems. 2021 Oct 26;6(5):e0093721. doi: 10.1128/mSystems.00937-21. Epub 2021 Sep 28.

The gutSMASH web server: automated identification of primary metabolic gene clusters from the gut microbiota.肠道 SMASH 网络服务器：从肠道微生物群中自动识别主要代谢基因簇。

Nucleic Acids Res. 2021 Jul 2;49(W1):W263-W270. doi: 10.1093/nar/gkab353.

Rhizosphere microbiome: Engineering bacterial competitiveness for enhancing crop production.根际微生物组：通过工程改造细菌竞争力来提高作物产量。

J Adv Res. 2020 Apr 29;24:337-352. doi: 10.1016/j.jare.2020.04.014. eCollection 2020 Jul.

An Integrative Approach to Assessing Diet-Cancer Relationships.评估饮食与癌症关系的综合方法。

Metabolites. 2020 Mar 25;10(4):123. doi: 10.3390/metabo10040123.

Integration of Metabolomic and Other Omics Data in Population-Based Study Designs: An Epidemiological Perspective.基于人群的研究设计中代谢组学与其他组学数据的整合：流行病学视角

Metabolites. 2019 Jun 18;9(6):117. doi: 10.3390/metabo9060117.

Exploring the Human Microbiome: The Potential Future Role of Next-Generation Sequencing in Disease Diagnosis and Treatment.探索人类微生物组：下一代测序在疾病诊断和治疗中的潜在未来作用。

Front Immunol. 2019 Jan 7;9:2868. doi: 10.3389/fimmu.2018.02868. eCollection 2018.

A multi-source domain annotation pipeline for quantitative metagenomic and metatranscriptomic functional profiling.用于定量宏基因组和宏转录组功能分析的多源域注释管道。

Microbiome. 2018 Aug 28;6(1):149. doi: 10.1186/s40168-018-0532-2.

Genes and Gut Bacteria Involved in Luminal Butyrate Reduction Caused by Diet and Loperamide.饮食和洛哌丁胺导致肠腔丁酸盐减少所涉及的基因和肠道细菌。

Genes (Basel). 2017 Nov 28;8(12):350. doi: 10.3390/genes8120350.

A clinician's guide to microbiome analysis.临床医生微生物组分析指南。

Nat Rev Gastroenterol Hepatol. 2017 Oct;14(10):585-595. doi: 10.1038/nrgastro.2017.97. Epub 2017 Aug 9.

本文引用的文献

Statistical methods for detecting differentially abundant features in clinical metagenomic samples.用于检测临床宏基因组样本中差异丰度特征的统计方法。

PLoS Comput Biol. 2009 Apr;5(4):e1000352. doi: 10.1371/journal.pcbi.1000352. Epub 2009 Apr 10.

Quantifying environmental adaptation of metabolic pathways in metagenomics.宏基因组学中代谢途径环境适应性的量化

Proc Natl Acad Sci U S A. 2009 Feb 3;106(5):1374-9. doi: 10.1073/pnas.0808022106. Epub 2009 Jan 22.

A core gut microbiome in obese and lean twins.肥胖与消瘦双胞胎的核心肠道微生物群。

Nature. 2009 Jan 22;457(7228):480-4. doi: 10.1038/nature07540. Epub 2008 Nov 30.

The metagenomics RAST server - a public resource for the automatic phylogenetic and functional analysis of metagenomes.宏基因组学RAST服务器——用于宏基因组自动系统发育和功能分析的公共资源。

BMC Bioinformatics. 2008 Sep 19;9:386. doi: 10.1186/1471-2105-9-386.

Identifying functional modules in protein-protein interaction networks: an integrated exact approach.识别蛋白质-蛋白质相互作用网络中的功能模块：一种综合精确方法。

Bioinformatics. 2008 Jul 1;24(13):i223-31. doi: 10.1093/bioinformatics/btn161.

KEGG for linking genomes to life and the environment.京都基因与基因组百科全书，用于将基因组与生命及环境相联系。

Nucleic Acids Res. 2008 Jan;36(Database issue):D480-4. doi: 10.1093/nar/gkm882. Epub 2007 Dec 12.

Comparative metagenomics revealed commonly enriched gene sets in human gut microbiomes.比较宏基因组学揭示了人类肠道微生物群中常见的富集基因集。

DNA Res. 2007 Aug 31;14(4):169-81. doi: 10.1093/dnares/dsm018. Epub 2007 Oct 3.

An obesity-associated gut microbiome with increased capacity for energy harvest.一种与肥胖相关的肠道微生物群，其能量获取能力增强。

Nature. 2006 Dec 21;444(7122):1027-31. doi: 10.1038/nature05414.

An application of statistics to comparative metagenomics.统计学在比较宏基因组学中的应用。

BMC Bioinformatics. 2006 Mar 20;7:162. doi: 10.1186/1471-2105-7-162.

Comparative metagenomics of microbial communities.微生物群落的比较宏基因组学

Science. 2005 Apr 22;308(5721):554-7. doi: 10.1126/science.1107851.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

MetaPath：识别宏基因组数据集中差异丰富的代谢途径。

MetaPath: identifying differentially abundant metabolic pathways in metagenomic datasets.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献