• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TaxiBGC:一种基于分类学的方法,用于对宏基因组中实验表征的微生物生物合成基因簇和次生代谢产物产生潜力进行分析。

TaxiBGC: a Taxonomy-Guided Approach for Profiling Experimentally Characterized Microbial Biosynthetic Gene Clusters and Secondary Metabolite Production Potential in Metagenomes.

机构信息

Microbiome Program, Center for Individualized Medicine, Mayo Clinicgrid.66875.3a, Rochester, Minnesota, USA.

Division of Surgery Research, Department of Surgery, Mayo Clinicgrid.66875.3a, Rochester, Minnesota, USA.

出版信息

mSystems. 2022 Dec 20;7(6):e0092522. doi: 10.1128/msystems.00925-22. Epub 2022 Nov 15.

DOI:10.1128/msystems.00925-22
PMID:36378489
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9765181/
Abstract

Biosynthetic gene clusters (BGCs) in microbial genomes encode bioactive secondary metabolites (SMs), which can play important roles in microbe-microbe and host-microbe interactions. Given the biological significance of SMs and the current profound interest in the metabolic functions of microbiomes, the unbiased identification of BGCs from high-throughput metagenomic data could offer novel insights into the complex chemical ecology of microbial communities. Currently available tools for predicting BGCs from shotgun metagenomes have several limitations, including the need for computationally demanding read assembly, predicting a narrow breadth of BGC classes, and not providing the SM product. To overcome these limitations, we developed onomy-guided dentification of iosynthetic ene lusters (TaxiBGC), a command-line tool for predicting experimentally characterized BGCs (and inferring their known SMs) in metagenomes by first pinpointing the microbial species likely to harbor them. We benchmarked TaxiBGC on various simulated metagenomes, showing that our taxonomy-guided approach could predict BGCs with much-improved performance (mean F score, 0.56; mean PPV score, 0.80) compared with directly identifying BGCs by mapping sequencing reads onto the BGC genes (mean F score, 0.49; mean PPV score, 0.41). Next, by applying TaxiBGC on 2,650 metagenomes from the Human Microbiome Project and various case-control gut microbiome studies, we were able to associate BGCs (and their SMs) with different human body sites and with multiple diseases, including Crohn's disease and liver cirrhosis. In all, TaxiBGC provides an platform to predict experimentally characterized BGCs and their SM production potential in metagenomic data while demonstrating important advantages over existing techniques. Currently available bioinformatics tools to identify BGCs from metagenomic sequencing data are limited in their predictive capability or ease of use to even computationally oriented researchers. We present an automated computational pipeline called TaxiBGC, which predicts experimentally characterized BGCs (and infers their known SMs) in shotgun metagenomes by first considering the microbial species source. Through rigorous benchmarking techniques on simulated metagenomes, we show that TaxiBGC provides a significant advantage over existing methods. When demonstrating TaxiBGC on thousands of human microbiome samples, we associate BGCs encoding bacteriocins with different human body sites and diseases, thereby elucidating a possible novel role of this antibiotic class in maintaining the stability of microbial ecosystems throughout the human body. Furthermore, we report for the first time gut microbial BGC associations shared among multiple pathologies. Ultimately, we expect our tool to facilitate future investigations into the chemical ecology of microbial communities across diverse niches and pathologies.

摘要

生物合成基因簇(BGCs)在微生物基因组中编码生物活性的次生代谢产物(SMs),这些产物在微生物-微生物和宿主-微生物的相互作用中起着重要作用。鉴于 SMs 的生物学意义以及目前对微生物组代谢功能的浓厚兴趣,从高通量宏基因组数据中无偏地识别 BGCs 可以为微生物群落的复杂化学生态提供新的见解。目前用于从 shotgun 宏基因组中预测 BGCs 的工具存在一些局限性,包括需要计算量大的读取组装、预测 BGC 类的范围较窄,以及不提供 SM 产物。为了克服这些限制,我们开发了基于分类学的生物合成基因簇识别工具(TaxiBGC),这是一种命令行工具,用于通过首先确定可能携带它们的微生物物种,在宏基因组中预测经过实验表征的 BGCs(并推断它们已知的 SMs)。我们在各种模拟的宏基因组上对 TaxiBGC 进行了基准测试,结果表明,与直接通过将测序reads 映射到 BGC 基因上来识别 BGCs 相比,我们的分类学指导方法可以显著提高 BGCs 的预测性能(平均 F 分数为 0.56;平均 PPV 分数为 0.80)。接下来,通过将 TaxiBGC 应用于来自人类微生物组计划和各种病例对照肠道微生物组研究的 2650 个宏基因组,我们能够将 BGCs(及其 SMs)与不同的人体部位和多种疾病相关联,包括克罗恩病和肝硬化。总之,TaxiBGC 提供了一个平台,可以在宏基因组数据中预测经过实验表征的 BGCs 及其 SM 产生潜力,同时与现有技术相比具有重要优势。目前可用于从宏基因组测序数据中识别 BGCs 的生物信息学工具在预测能力或甚至对计算导向的研究人员的易用性方面都存在局限性。我们提出了一种名为 TaxiBGC 的自动化计算管道,该管道通过首先考虑微生物物种来源,预测 shotgun 宏基因组中的实验表征 BGCs(并推断它们已知的 SMs)。通过对模拟宏基因组进行严格的基准测试技术,我们表明 TaxiBGC 提供了比现有方法显著的优势。在对数千个人类微生物组样本进行 TaxiBGC 演示时,我们将编码细菌素的 BGCs 与不同的人体部位和疾病相关联,从而阐明了这种抗生素类在维持人体微生物生态系统稳定性方面的可能新作用。此外,我们首次报告了多种病理情况下肠道微生物 BGC 关联。最终,我们希望我们的工具能够促进未来对不同生态位和病理条件下微生物群落化学生态的研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a00/9765181/3a8078032180/msystems.00925-22-f006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a00/9765181/16baadb5e174/msystems.00925-22-f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a00/9765181/d28013033bfe/msystems.00925-22-f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a00/9765181/841490c93df2/msystems.00925-22-f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a00/9765181/33c38e7e34ba/msystems.00925-22-f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a00/9765181/5db2ac7f31c8/msystems.00925-22-f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a00/9765181/3a8078032180/msystems.00925-22-f006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a00/9765181/16baadb5e174/msystems.00925-22-f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a00/9765181/d28013033bfe/msystems.00925-22-f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a00/9765181/841490c93df2/msystems.00925-22-f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a00/9765181/33c38e7e34ba/msystems.00925-22-f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a00/9765181/5db2ac7f31c8/msystems.00925-22-f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a00/9765181/3a8078032180/msystems.00925-22-f006.jpg

相似文献

1
TaxiBGC: a Taxonomy-Guided Approach for Profiling Experimentally Characterized Microbial Biosynthetic Gene Clusters and Secondary Metabolite Production Potential in Metagenomes.TaxiBGC:一种基于分类学的方法,用于对宏基因组中实验表征的微生物生物合成基因簇和次生代谢产物产生潜力进行分析。
mSystems. 2022 Dec 20;7(6):e0092522. doi: 10.1128/msystems.00925-22. Epub 2022 Nov 15.
2
Mining metagenomic data to gain a new insight into the gut microbial biosynthetic potential in placental mammals.从宏基因组数据中挖掘新的见解,以了解胎盘哺乳动物肠道微生物的生物合成潜力。
Microbiol Spectr. 2024 Oct 3;12(10):e0086424. doi: 10.1128/spectrum.00864-24. Epub 2024 Aug 20.
3
Long-Read Metagenome-Assembled Genomes Improve Identification of Novel Complete Biosynthetic Gene Clusters in a Complex Microbial Activated Sludge Ecosystem.长读长基因组组装提高了复杂微生物活性污泥生态系统中新型完整生物合成基因簇的鉴定。
mSystems. 2022 Dec 20;7(6):e0063222. doi: 10.1128/msystems.00632-22. Epub 2022 Nov 29.
4
Multi-site analysis of biosynthetic gene clusters from the periodontitis oral microbiome.牙周炎口腔微生物组生物合成基因簇的多地点分析。
J Med Microbiol. 2024 Oct;73(10). doi: 10.1099/jmm.0.001898.
5
Identification of the Bacterial Biosynthetic Gene Clusters of the Oral Microbiome Illuminates the Unexplored Social Language of Bacteria during Health and Disease.口腔微生物组的细菌生物合成基因簇的鉴定揭示了健康和疾病期间细菌未被探索的社会语言。
mBio. 2019 Apr 16;10(2):e00321-19. doi: 10.1128/mBio.00321-19.
6
A rapid and efficient strategy to identify and recover biosynthetic gene clusters from soil metagenomes.一种从土壤宏基因组中快速高效鉴定和回收生物合成基因簇的策略。
Appl Microbiol Biotechnol. 2022 Apr;106(8):3293-3306. doi: 10.1007/s00253-022-11917-y. Epub 2022 Apr 18.
7
Long-read metagenomics of soil communities reveals phylum-specific secondary metabolite dynamics.土壤群落的长读宏基因组学揭示了门特异性次生代谢物动态。
Commun Biol. 2021 Nov 18;4(1):1302. doi: 10.1038/s42003-021-02809-4.
8
A systematically biosynthetic investigation of lactic acid bacteria reveals diverse antagonistic bacteriocins that potentially shape the human microbiome.一项针对乳酸菌的系统性生物合成研究揭示了多样化的拮抗细菌素,这些细菌素可能会影响人类微生物组。
Microbiome. 2023 Apr 27;11(1):91. doi: 10.1186/s40168-023-01540-y.
9
Elucidating the Diversity and Potential Function of Nonribosomal Peptide and Polyketide Biosynthetic Gene Clusters in the Root Microbiome.解析根际微生物群中非核糖体肽和聚酮化合物生物合成基因簇的多样性及潜在功能
mSystems. 2020 Dec 22;5(6):e00866-20. doi: 10.1128/mSystems.00866-20.
10
Predicting fungal secondary metabolite activity from biosynthetic gene cluster data using machine learning.基于生物合成基因簇数据利用机器学习预测真菌次生代谢物活性。
Microbiol Spectr. 2024 Feb 6;12(2):e0340023. doi: 10.1128/spectrum.03400-23. Epub 2024 Jan 9.

引用本文的文献

1
Sequence modeling tools to decode the biosynthetic diversity of the human microbiome.用于解码人类微生物组生物合成多样性的序列建模工具。
mSystems. 2025 Jul 22;10(7):e0033325. doi: 10.1128/msystems.00333-25. Epub 2025 Jun 30.
2
New approaches to secondary metabolite discovery from anaerobic gut microbes.从厌氧肠道微生物中发现次生代谢产物的新方法。
Appl Microbiol Biotechnol. 2025 Jan 20;109(1):12. doi: 10.1007/s00253-024-13393-y.
3
Mining metagenomic data to gain a new insight into the gut microbial biosynthetic potential in placental mammals.

本文引用的文献

1
Stochastic block models reveal a robust nested pattern in healthy human gut microbiomes.随机块模型揭示了健康人类肠道微生物群中一种稳健的嵌套模式。
PNAS Nexus. 2022 May 23;1(3):pgac055. doi: 10.1093/pnasnexus/pgac055. eCollection 2022 Jul.
2
Identification of shared and disease-specific host gene-microbiome associations across human diseases using multi-omic integration.利用多组学整合技术鉴定人类疾病中共享和疾病特异性的宿主基因-微生物组关联。
Nat Microbiol. 2022 Jun;7(6):780-795. doi: 10.1038/s41564-022-01121-z. Epub 2022 May 16.
3
Resource-allocation constraint governs structure and function of microbial communities in metabolic modeling.
从宏基因组数据中挖掘新的见解,以了解胎盘哺乳动物肠道微生物的生物合成潜力。
Microbiol Spectr. 2024 Oct 3;12(10):e0086424. doi: 10.1128/spectrum.00864-24. Epub 2024 Aug 20.
4
Multi-omics in Crohn's disease: New insights from inside.克罗恩病的多组学研究:来自内部的新见解
Comput Struct Biotechnol J. 2023 May 13;21:3054-3072. doi: 10.1016/j.csbj.2023.05.010. eCollection 2023.
5
Evaluating the prebiotic effect of oligosaccharides on gut microbiome wellness using in vitro fecal fermentation.利用体外粪便发酵评估低聚糖对肠道微生物群健康的益生元效应。
NPJ Sci Food. 2023 May 9;7(1):18. doi: 10.1038/s41538-023-00195-1.
6
GMWI-webtool: a user-friendly browser application for assessing health through metagenomic gut microbiome profiling.GMWI-webtool:一款用户友好的浏览器应用程序,可通过宏基因组肠道微生物组分析评估健康状况。
Bioinformatics. 2023 Feb 3;39(2). doi: 10.1093/bioinformatics/btad061.
资源分配约束控制代谢建模中微生物群落的结构和功能。
Metab Eng. 2022 Mar;70:12-22. doi: 10.1016/j.ymben.2021.12.011. Epub 2022 Jan 3.
4
Benefiting from big data in natural products: importance of preserving foundational skills and prioritizing data quality.从天然产物的大数据中受益:重视基础技能的保留和数据质量的优先级。
Nat Prod Rep. 2021 Nov 17;38(11):1947-1953. doi: 10.1039/d1np00061f.
5
BiG-MAP: an Automated Pipeline To Profile Metabolic Gene Cluster Abundance and Expression in Microbiomes.BiG-MAP:一种用于分析微生物群落中代谢基因簇丰度和表达的自动化流程。
mSystems. 2021 Oct 26;6(5):e0093721. doi: 10.1128/mSystems.00937-21. Epub 2021 Sep 28.
6
Comprehensive analysis of bacteriocins in Streptococcus mutans.全面分析变异链球菌中的细菌素。
Sci Rep. 2021 Jun 21;11(1):12963. doi: 10.1038/s41598-021-92370-1.
7
The gutSMASH web server: automated identification of primary metabolic gene clusters from the gut microbiota.肠道 SMASH 网络服务器:从肠道微生物群中自动识别主要代谢基因簇。
Nucleic Acids Res. 2021 Jul 2;49(W1):W263-W270. doi: 10.1093/nar/gkab353.
8
Integrating taxonomic, functional, and strain-level profiling of diverse microbial communities with bioBakery 3.利用 bioBakery 3 整合具有分类学、功能和菌株水平特征的多样化微生物群落。
Elife. 2021 May 4;10:e65088. doi: 10.7554/eLife.65088.
9
A predictive index for health status using species-level gut microbiome profiling.利用物种水平的肠道微生物组谱进行健康状况预测的指标。
Nat Commun. 2020 Sep 15;11(1):4635. doi: 10.1038/s41467-020-18476-8.
10
Mini review: Genome mining approaches for the identification of secondary metabolite biosynthetic gene clusters in .简短综述:用于鉴定……中次生代谢物生物合成基因簇的基因组挖掘方法
Comput Struct Biotechnol J. 2020 Jun 21;18:1548-1556. doi: 10.1016/j.csbj.2020.06.024. eCollection 2020.