• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

iPRESTO:与特定天然产物亚结构相关的生物合成亚簇的自动发现。

iPRESTO: Automated discovery of biosynthetic sub-clusters linked to specific natural product substructures.

机构信息

Bioinformatics Group, Wageningen University, Wageningen, the Netherlands.

Netherlands eScience Center, Amsterdam, the Netherlands.

出版信息

PLoS Comput Biol. 2023 Feb 9;19(2):e1010462. doi: 10.1371/journal.pcbi.1010462. eCollection 2023 Feb.

DOI:10.1371/journal.pcbi.1010462
PMID:36758069
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9946207/
Abstract

Microbial specialised metabolism is full of valuable natural products that are applied clinically, agriculturally, and industrially. The genes that encode their biosynthesis are often physically clustered on the genome in biosynthetic gene clusters (BGCs). Many BGCs consist of multiple groups of co-evolving genes called sub-clusters that are responsible for the biosynthesis of a specific chemical moiety in a natural product. Sub-clusters therefore provide an important link between the structures of a natural product and its BGC, which can be leveraged for predicting natural product structures from sequence, as well as for linking chemical structures and metabolomics-derived mass features to BGCs. While some initial computational methodologies have been devised for sub-cluster detection, current approaches are not scalable, have only been run on small and outdated datasets, or produce an impractically large number of possible sub-clusters to mine through. Here, we constructed a scalable method for unsupervised sub-cluster detection, called iPRESTO, based on topic modelling and statistical analysis of co-occurrence patterns of enzyme-coding protein families. iPRESTO was used to mine sub-clusters across 150,000 prokaryotic BGCs from antiSMASH-DB. After annotating a fraction of the resulting sub-cluster families, we could predict a substructure for 16% of the antiSMASH-DB BGCs. Additionally, our method was able to confirm 83% of the experimentally characterised sub-clusters in MIBiG reference BGCs. Based on iPRESTO-detected sub-clusters, we could correctly identify the BGCs for xenorhabdin and salbostatin biosynthesis (which had not yet been annotated in BGC databases), as well as propose a candidate BGC for akashin biosynthesis. Additionally, we show for a collection of 145 actinobacteria how substructures can aid in linking BGCs to molecules by correlating iPRESTO-detected sub-clusters to MS/MS-derived Mass2Motifs substructure patterns. This work paves the way for deeper functional and structural annotation of microbial BGCs by improved linking of orphan molecules to their cognate gene clusters, thus facilitating accelerated natural product discovery.

摘要

微生物特化代谢充满了有价值的天然产物,这些产物在临床、农业和工业中得到应用。编码其生物合成的基因通常在基因组上物理聚集在生物合成基因簇 (BGCs) 中。许多 BGC 由多个协同进化的基因群组成,称为亚簇,负责天然产物中特定化学部分的生物合成。因此,亚簇为天然产物的结构与其 BGC 之间提供了重要的联系,这可以用于从序列预测天然产物的结构,以及将化学结构和代谢组学衍生的质量特征与 BGC 联系起来。虽然已经设计了一些初始的计算方法来检测亚簇,但当前的方法不可扩展,仅在小型和过时的数据集上运行,或者产生大量不切实际的可能亚簇来挖掘。在这里,我们构建了一种基于主题建模和酶编码蛋白家族共现模式的统计分析的无监督亚簇检测方法,称为 iPRESTO。iPRESTO 用于挖掘来自 antiSMASH-DB 的 150,000 个原核 BGC 中的亚簇。在注释一部分由此产生的亚簇家族之后,我们可以预测 16%的 antiSMASH-DB BGC 的亚结构。此外,我们的方法能够确认 MIBiG 参考 BGC 中 83%的实验表征的亚簇。基于 iPRESTO 检测到的亚簇,我们可以正确识别 xenorhabdin 和 salbostatin 生物合成的 BGC(这些 BGC 尚未在 BGC 数据库中注释),并提出 akashin 生物合成的候选 BGC。此外,我们展示了 145 种放线菌如何通过将 iPRESTO 检测到的亚簇与 MS/MS 衍生的 Mass2Motifs 亚结构模式相关联,来帮助将 BGC 与分子联系起来。这项工作通过改进将孤儿分子与其同源基因簇联系起来,为微生物 BGC 的更深入功能和结构注释铺平了道路,从而促进了加速天然产物的发现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d088/9946207/747ffebfdce7/pcbi.1010462.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d088/9946207/55ee4f086184/pcbi.1010462.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d088/9946207/c7a1da7b812a/pcbi.1010462.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d088/9946207/f08f81c71128/pcbi.1010462.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d088/9946207/41b43676892c/pcbi.1010462.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d088/9946207/39c2515f81e8/pcbi.1010462.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d088/9946207/747ffebfdce7/pcbi.1010462.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d088/9946207/55ee4f086184/pcbi.1010462.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d088/9946207/c7a1da7b812a/pcbi.1010462.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d088/9946207/f08f81c71128/pcbi.1010462.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d088/9946207/41b43676892c/pcbi.1010462.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d088/9946207/39c2515f81e8/pcbi.1010462.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d088/9946207/747ffebfdce7/pcbi.1010462.g006.jpg

相似文献

1
iPRESTO: Automated discovery of biosynthetic sub-clusters linked to specific natural product substructures.iPRESTO:与特定天然产物亚结构相关的生物合成亚簇的自动发现。
PLoS Comput Biol. 2023 Feb 9;19(2):e1010462. doi: 10.1371/journal.pcbi.1010462. eCollection 2023 Feb.
2
Mining metagenomic data to gain a new insight into the gut microbial biosynthetic potential in placental mammals.从宏基因组数据中挖掘新的见解,以了解胎盘哺乳动物肠道微生物的生物合成潜力。
Microbiol Spectr. 2024 Oct 3;12(10):e0086424. doi: 10.1128/spectrum.00864-24. Epub 2024 Aug 20.
3
Enhanced correlation-based linking of biosynthetic gene clusters to their metabolic products through chemical class matching.通过化学分类匹配增强生物合成基因簇与其代谢产物的相关性链接。
Microbiome. 2023 Jan 23;11(1):13. doi: 10.1186/s40168-022-01444-3.
4
Recent development of antiSMASH and other computational approaches to mine secondary metabolite biosynthetic gene clusters.抗 SMASH 及其他计算方法在挖掘次级代谢产物生物合成基因簇方面的最新进展。
Brief Bioinform. 2019 Jul 19;20(4):1103-1113. doi: 10.1093/bib/bbx146.
5
Comprehensive curation and analysis of fungal biosynthetic gene clusters of published natural products.已发表天然产物的真菌生物合成基因簇的综合整理与分析
Fungal Genet Biol. 2016 Apr;89:18-28. doi: 10.1016/j.fgb.2016.01.012. Epub 2016 Jan 22.
6
TaxiBGC: a Taxonomy-Guided Approach for Profiling Experimentally Characterized Microbial Biosynthetic Gene Clusters and Secondary Metabolite Production Potential in Metagenomes.TaxiBGC:一种基于分类学的方法,用于对宏基因组中实验表征的微生物生物合成基因簇和次生代谢产物产生潜力进行分析。
mSystems. 2022 Dec 20;7(6):e0092522. doi: 10.1128/msystems.00925-22. Epub 2022 Nov 15.
7
Predicting fungal secondary metabolite activity from biosynthetic gene cluster data using machine learning.基于生物合成基因簇数据利用机器学习预测真菌次生代谢物活性。
Microbiol Spectr. 2024 Feb 6;12(2):e0340023. doi: 10.1128/spectrum.03400-23. Epub 2024 Jan 9.
8
MIBiG 3.0: a community-driven effort to annotate experimentally validated biosynthetic gene clusters.MIBiG 3.0:一个社区驱动的努力,用于注释经过实验验证的生物合成基因簇。
Nucleic Acids Res. 2023 Jan 6;51(D1):D603-D610. doi: 10.1093/nar/gkac1049.
9
Evolution and Diversity of Biosynthetic Gene Clusters in .……中生物合成基因簇的进化与多样性 (原文不完整,缺少具体所指对象)
Front Microbiol. 2018 Jun 5;9:1158. doi: 10.3389/fmicb.2018.01158. eCollection 2018.
10
Correlative metabologenomics of 110 fungi reveals metabolite-gene cluster pairs.110 种真菌的相关代谢组学和基因簇组对揭示代谢物。
Nat Chem Biol. 2023 Jul;19(7):846-854. doi: 10.1038/s41589-023-01276-8. Epub 2023 Mar 6.

引用本文的文献

1
Context matters: assessing the impacts of genomic background and ecology on microbial biosynthetic gene cluster evolution.背景很重要:评估基因组背景和生态学对微生物生物合成基因簇进化的影响。
mSystems. 2025 Mar 18;10(3):e0153824. doi: 10.1128/msystems.01538-24. Epub 2025 Feb 24.
2
CLOCI: unveiling cryptic fungal gene clusters with generalized detection.CLOCI:利用广义检测揭示隐匿真菌基因簇。
Nucleic Acids Res. 2024 Sep 9;52(16):e75. doi: 10.1093/nar/gkae625.
3
A Comprehensive Self-Resistance Gene Database for Natural-Product Discovery with an Application to Marine Bacterial Genome Mining.

本文引用的文献

1
Enhanced correlation-based linking of biosynthetic gene clusters to their metabolic products through chemical class matching.通过化学分类匹配增强生物合成基因簇与其代谢产物的相关性链接。
Microbiome. 2023 Jan 23;11(1):13. doi: 10.1186/s40168-022-01444-3.
2
Comprehensive Large-Scale Integrative Analysis of Omics Data To Accelerate Specialized Metabolite Discovery.用于加速特殊代谢物发现的组学数据综合大规模整合分析
mSystems. 2021 Aug 31;6(4):e0072621. doi: 10.1128/mSystems.00726-21. Epub 2021 Aug 24.
3
antiSMASH 6.0: improving cluster detection and comparison capabilities.
用于天然产物发现的综合自我抗性基因数据库及其在海洋细菌基因组挖掘中的应用
Int J Mol Sci. 2023 Aug 4;24(15):12446. doi: 10.3390/ijms241512446.
4
NPOmix: A machine learning classifier to connect mass spectrometry fragmentation data to biosynthetic gene clusters.NPOmix:一种将质谱碎裂数据与生物合成基因簇相联系的机器学习分类器。
PNAS Nexus. 2022 Nov 16;1(5):pgac257. doi: 10.1093/pnasnexus/pgac257. eCollection 2022 Nov.
5
Enhanced correlation-based linking of biosynthetic gene clusters to their metabolic products through chemical class matching.通过化学分类匹配增强生物合成基因簇与其代谢产物的相关性链接。
Microbiome. 2023 Jan 23;11(1):13. doi: 10.1186/s40168-022-01444-3.
antiSMASH 6.0:提高簇检测和比较能力。
Nucleic Acids Res. 2021 Jul 2;49(W1):W29-W35. doi: 10.1093/nar/gkab335.
4
Ranking microbial metabolomic and genomic links in the NPLinker framework using complementary scoring functions.在 NPLinker 框架中使用互补评分函数对微生物代谢组学和基因组学关联进行排名。
PLoS Comput Biol. 2021 May 4;17(5):e1008920. doi: 10.1371/journal.pcbi.1008920. eCollection 2021 May.
5
BiG-SLiCE: A highly scalable tool maps the diversity of 1.2 million biosynthetic gene clusters.BiG-SLiCE:一个高度可扩展的工具,可绘制 120 万个生物合成基因簇的多样性图谱。
Gigascience. 2021 Jan 13;10(1). doi: 10.1093/gigascience/giaa154.
6
Comprehensive prediction of secondary metabolite structure and biological activity from microbial genome sequences.从微生物基因组序列中综合预测次生代谢物结构和生物活性。
Nat Commun. 2020 Nov 27;11(1):6058. doi: 10.1038/s41467-020-19986-1.
7
The antiSMASH database version 3: increased taxonomic coverage and new query features for modular enzymes.反 SMASH 数据库版本 3:增加了分类学覆盖范围和新的模块酶查询功能。
Nucleic Acids Res. 2021 Jan 8;49(D1):D639-D643. doi: 10.1093/nar/gkaa978.
8
Linking genomics and metabolomics to chart specialized metabolic diversity.将基因组学和代谢组学联系起来,以绘制专门代谢多样性图谱。
Chem Soc Rev. 2020 Jun 7;49(11):3297-3314. doi: 10.1039/d0cs00162g. Epub 2020 May 12.
9
The Natural Products Atlas: An Open Access Knowledge Base for Microbial Natural Products Discovery.《天然产物图谱:微生物天然产物发现的开放获取知识库》
ACS Cent Sci. 2019 Nov 27;5(11):1824-1833. doi: 10.1021/acscentsci.9b00806. Epub 2019 Nov 14.
10
A computational framework to explore large-scale biosynthetic diversity.用于探索大规模生物合成多样性的计算框架。
Nat Chem Biol. 2020 Jan;16(1):60-68. doi: 10.1038/s41589-019-0400-9. Epub 2019 Nov 25.