• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在 NPLinker 框架中使用互补评分函数对微生物代谢组学和基因组学关联进行排名。

Ranking microbial metabolomic and genomic links in the NPLinker framework using complementary scoring functions.

机构信息

School of Computing Science, University of Glasgow, Glasgow, United Kingdom.

Bioinformatics Group, Wageningen University, Wageningen, The Netherlands.

出版信息

PLoS Comput Biol. 2021 May 4;17(5):e1008920. doi: 10.1371/journal.pcbi.1008920. eCollection 2021 May.

DOI:10.1371/journal.pcbi.1008920
PMID:33945539
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8130963/
Abstract

Specialised metabolites from microbial sources are well-known for their wide range of biomedical applications, particularly as antibiotics. When mining paired genomic and metabolomic data sets for novel specialised metabolites, establishing links between Biosynthetic Gene Clusters (BGCs) and metabolites represents a promising way of finding such novel chemistry. However, due to the lack of detailed biosynthetic knowledge for the majority of predicted BGCs, and the large number of possible combinations, this is not a simple task. This problem is becoming ever more pressing with the increased availability of paired omics data sets. Current tools are not effective at identifying valid links automatically, and manual verification is a considerable bottleneck in natural product research. We demonstrate that using multiple link-scoring functions together makes it easier to prioritise true links relative to others. Based on standardising a commonly used score, we introduce a new, more effective score, and introduce a novel score using an Input-Output Kernel Regression approach. Finally, we present NPLinker, a software framework to link genomic and metabolomic data. Results are verified using publicly available data sets that include validated links.

摘要

微生物来源的特殊代谢产物以其广泛的生物医学应用而闻名,特别是作为抗生素。在挖掘配对的基因组和代谢组数据集以寻找新的特殊代谢产物时,在生物合成基因簇 (BGC) 和代谢产物之间建立联系是一种很有前途的方法。然而,由于大多数预测 BGC 的详细生物合成知识缺乏,以及可能的组合数量众多,这并不是一项简单的任务。随着配对组学数据集的可用性增加,这个问题变得越来越紧迫。目前的工具不能有效地自动识别有效链接,而手动验证是天然产物研究的一个相当大的瓶颈。我们证明,使用多个链接评分函数一起,可以更容易地将真实链接相对于其他链接进行优先级排序。基于对常用分数的标准化,我们引入了一个新的、更有效的分数,并引入了一个使用输入-输出核回归方法的新分数。最后,我们提出了 NPLinker,这是一个用于链接基因组和代谢组数据的软件框架。使用包含验证链接的公开数据集来验证结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6567/8130963/acbe3fb9fe04/pcbi.1008920.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6567/8130963/bc2d1c18d7ad/pcbi.1008920.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6567/8130963/fa63e0f0b6d7/pcbi.1008920.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6567/8130963/da3d1be6eae5/pcbi.1008920.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6567/8130963/9f9bd8f319b3/pcbi.1008920.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6567/8130963/cfd31fbcab2d/pcbi.1008920.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6567/8130963/a3f67e24d6d7/pcbi.1008920.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6567/8130963/d77044c76d13/pcbi.1008920.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6567/8130963/acbe3fb9fe04/pcbi.1008920.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6567/8130963/bc2d1c18d7ad/pcbi.1008920.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6567/8130963/fa63e0f0b6d7/pcbi.1008920.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6567/8130963/da3d1be6eae5/pcbi.1008920.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6567/8130963/9f9bd8f319b3/pcbi.1008920.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6567/8130963/cfd31fbcab2d/pcbi.1008920.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6567/8130963/a3f67e24d6d7/pcbi.1008920.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6567/8130963/d77044c76d13/pcbi.1008920.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6567/8130963/acbe3fb9fe04/pcbi.1008920.g008.jpg

相似文献

1
Ranking microbial metabolomic and genomic links in the NPLinker framework using complementary scoring functions.在 NPLinker 框架中使用互补评分函数对微生物代谢组学和基因组学关联进行排名。
PLoS Comput Biol. 2021 May 4;17(5):e1008920. doi: 10.1371/journal.pcbi.1008920. eCollection 2021 May.
2
Enhanced correlation-based linking of biosynthetic gene clusters to their metabolic products through chemical class matching.通过化学分类匹配增强生物合成基因簇与其代谢产物的相关性链接。
Microbiome. 2023 Jan 23;11(1):13. doi: 10.1186/s40168-022-01444-3.
3
Recent development of antiSMASH and other computational approaches to mine secondary metabolite biosynthetic gene clusters.抗 SMASH 及其他计算方法在挖掘次级代谢产物生物合成基因簇方面的最新进展。
Brief Bioinform. 2019 Jul 19;20(4):1103-1113. doi: 10.1093/bib/bbx146.
4
MIBiG 2.0: a repository for biosynthetic gene clusters of known function.MIBiG 2.0:已知功能的生物合成基因簇的存储库。
Nucleic Acids Res. 2020 Jan 8;48(D1):D454-D458. doi: 10.1093/nar/gkz882.
5
BiG-SLiCE: A highly scalable tool maps the diversity of 1.2 million biosynthetic gene clusters.BiG-SLiCE:一个高度可扩展的工具,可绘制 120 万个生物合成基因簇的多样性图谱。
Gigascience. 2021 Jan 13;10(1). doi: 10.1093/gigascience/giaa154.
6
Output ordering and prioritisation system (OOPS): ranking biosynthetic gene clusters to enhance bioactive metabolite discovery.输出排序和优先级系统(OOPS):对生物合成基因簇进行排名以增强生物活性代谢产物的发现。
J Ind Microbiol Biotechnol. 2018 Jul;45(7):615-619. doi: 10.1007/s10295-017-1993-1. Epub 2017 Dec 18.
7
Comparative Metabologenomics Analysis of Polar Actinomycetes.极地放线菌的比较代谢组学分析
Mar Drugs. 2021 Feb 10;19(2):103. doi: 10.3390/md19020103.
8
A computational framework to explore large-scale biosynthetic diversity.用于探索大规模生物合成多样性的计算框架。
Nat Chem Biol. 2020 Jan;16(1):60-68. doi: 10.1038/s41589-019-0400-9. Epub 2019 Nov 25.
9
Linking biosynthetic and chemical space to accelerate microbial secondary metabolite discovery.将生物合成和化学空间联系起来,加速微生物次生代谢产物的发现。
FEMS Microbiol Lett. 2019 Jul 1;366(13). doi: 10.1093/femsle/fnz142.
10
BGCFlow: systematic pangenome workflow for the analysis of biosynthetic gene clusters across large genomic datasets.BGCFlow:用于分析大型基因组数据集生物合成基因簇的系统泛基因组工作流程。
Nucleic Acids Res. 2024 Jun 10;52(10):5478-5495. doi: 10.1093/nar/gkae314.

引用本文的文献

1
Sequence modeling tools to decode the biosynthetic diversity of the human microbiome.用于解码人类微生物组生物合成多样性的序列建模工具。
mSystems. 2025 Jul 22;10(7):e0033325. doi: 10.1128/msystems.00333-25. Epub 2025 Jun 30.
2
Bridging Ethnobotanical Knowledge and Multi-Omics Approaches for Plant-Derived Natural Product Discovery.架起民族植物学知识与多组学方法之间的桥梁以发现植物源天然产物
Metabolites. 2025 May 29;15(6):362. doi: 10.3390/metabo15060362.
3
antiSMASH 8.0: extended gene cluster detection capabilities and analyses of chemistry, enzymology, and regulation.

本文引用的文献

1
Spec2Vec: Improved mass spectral similarity scoring through learning of structural relationships.Spec2Vec:通过学习结构关系提高质谱相似性评分。
PLoS Comput Biol. 2021 Feb 16;17(2):e1008724. doi: 10.1371/journal.pcbi.1008724. eCollection 2021 Feb.
2
A community resource for paired genomic and metabolomic data mining.用于基因组和代谢组学数据挖掘的社区资源。
Nat Chem Biol. 2021 Apr;17(4):363-368. doi: 10.1038/s41589-020-00724-z.
3
Comparative Metabologenomics Analysis of Polar Actinomycetes.极地放线菌的比较代谢组学分析
抗SMASH 8.0:扩展的基因簇检测能力以及对化学、酶学和调控的分析
Nucleic Acids Res. 2025 Jul 7;53(W1):W32-W38. doi: 10.1093/nar/gkaf334.
4
Deciphering the biosynthetic potential of microbial genomes using a BGC language processing neural network model.使用生物合成基因簇语言处理神经网络模型解析微生物基因组的生物合成潜力。
Nucleic Acids Res. 2025 Apr 10;53(7). doi: 10.1093/nar/gkaf305.
5
Effective data visualization strategies in untargeted metabolomics.非靶向代谢组学中的有效数据可视化策略
Nat Prod Rep. 2024 Dec 2. doi: 10.1039/d4np00039k.
6
Pattern-Based Genome Mining Guides Discovery of the Antibiotic Indanopyrrole A from a Marine Streptomycete.基于模式的基因组挖掘指导从海洋链霉菌中发现抗生素茚并吡咯A。
J Nat Prod. 2024 Dec 27;87(12):2768-2778. doi: 10.1021/acs.jnatprod.4c00934. Epub 2024 Nov 22.
7
Pattern-based genome mining guides discovery of the antibiotic indanopyrrole A from a marine streptomycetef.基于模式的基因组挖掘指导从海洋链霉菌中发现抗生素茚并吡咯A。
bioRxiv. 2024 Oct 29:2024.10.29.620887. doi: 10.1101/2024.10.29.620887.
8
Primed for Discovery.蓄势待发,探索无限。
Biochemistry. 2024 Nov 5;63(21):2705-2713. doi: 10.1021/acs.biochem.4c00464. Epub 2024 Oct 15.
9
Discovering type I cis-AT polyketides through computational mass spectrometry and genome mining with Seq2PKS.通过计算质谱和基因组挖掘 Seq2PKS 发现 I 型顺式-AT 聚酮化合物。
Nat Commun. 2024 Jun 25;15(1):5356. doi: 10.1038/s41467-024-49587-1.
10
Metabologenomics reveals strain-level genetic and chemical diversity of secondary metabolism.代谢组学生态学揭示了次级代谢的菌株水平遗传和化学多样性。
mSystems. 2024 Jul 23;9(7):e0033424. doi: 10.1128/msystems.00334-24. Epub 2024 Jun 25.
Mar Drugs. 2021 Feb 10;19(2):103. doi: 10.3390/md19020103.
4
BiG-SLiCE: A highly scalable tool maps the diversity of 1.2 million biosynthetic gene clusters.BiG-SLiCE:一个高度可扩展的工具,可绘制 120 万个生物合成基因簇的多样性图谱。
Gigascience. 2021 Jan 13;10(1). doi: 10.1093/gigascience/giaa154.
5
Linking genomics and metabolomics to chart specialized metabolic diversity.将基因组学和代谢组学联系起来,以绘制专门代谢多样性图谱。
Chem Soc Rev. 2020 Jun 7;49(11):3297-3314. doi: 10.1039/d0cs00162g. Epub 2020 May 12.
6
Natural Products as Sources of New Drugs over the Nearly Four Decades from 01/1981 to 09/2019.天然产物:1981 年 1 月至 2019 年 9 月近四十年来的新药来源
J Nat Prod. 2020 Mar 27;83(3):770-803. doi: 10.1021/acs.jnatprod.9b01285. Epub 2020 Mar 12.
7
A computational framework to explore large-scale biosynthetic diversity.用于探索大规模生物合成多样性的计算框架。
Nat Chem Biol. 2020 Jan;16(1):60-68. doi: 10.1038/s41589-019-0400-9. Epub 2019 Nov 25.
8
MetaMiner: A Scalable Peptidogenomics Approach for Discovery of Ribosomal Peptide Natural Products with Blind Modifications from Microbial Communities.MetaMiner:一种可扩展的肽组学生物学方法,用于从微生物群落中发现具有盲修饰的核糖体肽天然产物。
Cell Syst. 2019 Dec 18;9(6):600-608.e4. doi: 10.1016/j.cels.2019.09.004. Epub 2019 Oct 16.
9
MIBiG 2.0: a repository for biosynthetic gene clusters of known function.MIBiG 2.0:已知功能的生物合成基因簇的存储库。
Nucleic Acids Res. 2020 Jan 8;48(D1):D454-D458. doi: 10.1093/nar/gkz882.
10
A deep learning genome-mining strategy for biosynthetic gene cluster prediction.深度学习基因组挖掘策略用于生物合成基因簇预测。
Nucleic Acids Res. 2019 Oct 10;47(18):e110. doi: 10.1093/nar/gkz654.