• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

NPOmix:一种将质谱碎裂数据与生物合成基因簇相联系的机器学习分类器。

NPOmix: A machine learning classifier to connect mass spectrometry fragmentation data to biosynthetic gene clusters.

作者信息

Leão Tiago F, Wang Mingxun, da Silva Ricardo, Gurevich Alexey, Bauermeister Anelize, Gomes Paulo Wender P, Brejnrod Asker, Glukhov Evgenia, Aron Allegra T, Louwen Joris J R, Kim Hyun Woo, Reher Raphael, Fiore Marli F, van der Hooft Justin J J, Gerwick Lena, Gerwick William H, Bandeira Nuno, Dorrestein Pieter C

机构信息

Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA.

Center for Nuclear Energy in Agriculture, University of São Paulo, Piracicaba 13400-970, SP, Brazil.

出版信息

PNAS Nexus. 2022 Nov 16;1(5):pgac257. doi: 10.1093/pnasnexus/pgac257. eCollection 2022 Nov.

DOI:10.1093/pnasnexus/pgac257
PMID:36712343
原文链接:
https://pmc.ncbi.nlm.nih.gov/articles/PMC9802219/
Abstract

Microbial specialized metabolites are an important source of and inspiration for many pharmaceuticals, biotechnological products and play key roles in ecological processes. Untargeted metabolomics using liquid chromatography coupled with tandem mass spectrometry is an efficient technique to access metabolites from fractions and even environmental crude extracts. Nevertheless, metabolomics is limited in predicting structures or bioactivities for cryptic metabolites. Efficiently linking the biosynthetic potential inferred from (meta)genomics to the specialized metabolome would accelerate drug discovery programs by allowing metabolomics to make use of genetic predictions. Here, we present a -nearest neighbor classifier to systematically connect mass spectrometry fragmentation spectra to their corresponding biosynthetic gene clusters (independent of their chemical class). Our new pattern-based genome mining pipeline links biosynthetic genes to metabolites that they encode for, as detected via mass spectrometry from bacterial cultures or environmental microbiomes. Using paired datasets that include validated genes-mass spectral links from the Paired Omics Data Platform, we demonstrate this approach by automatically linking 18 previously known mass spectra (17 for which the biosynthesis gene clusters can be found at the MIBiG database plus palmyramide A) to their corresponding previously experimentally validated biosynthetic genes (e.g., via nuclear magnetic resonance or genetic engineering). We illustrated a computational example of how to use our Natural Products Mixed Omics (NPOmix) tool for siderophore mining that can be reproduced by the users. We conclude that NPOmix minimizes the need for culturing (it worked well on microbiomes) and facilitates specialized metabolite prioritization based on integrative omics mining.

摘要

微生物特殊代谢产物是许多药物和生物技术产品的重要来源及灵感源泉,并且在生态过程中发挥关键作用。使用液相色谱联用串联质谱的非靶向代谢组学是一种从馏分甚至环境粗提物中获取代谢产物的有效技术。然而,代谢组学在预测隐秘代谢产物的结构或生物活性方面存在局限性。通过使代谢组学能够利用基因预测,将从(宏)基因组学推断出的生物合成潜力与特殊代谢组有效联系起来,将加速药物发现计划。在此,我们提出一种最近邻分类器,以系统地将质谱碎裂谱与其相应的生物合成基因簇(与其化学类别无关)相连接。我们基于新模式的基因组挖掘流程将生物合成基因与其编码的代谢产物相联系,这些代谢产物是通过对细菌培养物或环境微生物群落进行质谱检测得到的。利用包括来自配对组学数据平台的经过验证的基因 - 质谱链接的配对数据集,我们通过自动将18个先前已知的质谱(其中17个在MIBiG数据库中可找到其生物合成基因簇,加上棕榈酰胺A)与其相应的先前经过实验验证的生物合成基因(例如,通过核磁共振或基因工程)相连接,展示了这种方法。我们举例说明了如何使用我们的天然产物混合组学(NPOmix)工具进行铁载体挖掘的计算示例,用户可以重现该示例。我们得出结论,NPOmix将培养需求降至最低(在微生物群落上效果良好),并基于综合组学挖掘促进特殊代谢产物的优先级排序。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0531/9802219/7d9c07d14021/pgac257fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0531/9802219/657a0aeaa941/pgac257fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0531/9802219/b97c4f2c387e/pgac257fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0531/9802219/f08a403398e2/pgac257fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0531/9802219/7d9c07d14021/pgac257fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0531/9802219/657a0aeaa941/pgac257fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0531/9802219/b97c4f2c387e/pgac257fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0531/9802219/f08a403398e2/pgac257fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0531/9802219/7d9c07d14021/pgac257fig4.jpg

相似文献

1
NPOmix: A machine learning classifier to connect mass spectrometry fragmentation data to biosynthetic gene clusters.NPOmix:一种将质谱碎裂数据与生物合成基因簇相联系的机器学习分类器。
PNAS Nexus. 2022 Nov 16;1(5):pgac257. doi: 10.1093/pnasnexus/pgac257. eCollection 2022 Nov.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Short-Term Memory Impairment短期记忆障碍
4
Plug-and-play use of tree-based methods: consequences for clinical prediction modeling.基于树的方法的即插即用:对临床预测模型的影响。
J Clin Epidemiol. 2025 Aug;184:111834. doi: 10.1016/j.jclinepi.2025.111834. Epub 2025 May 19.
5
Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验:定性证据综合。
Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.
6
Novel Gene Clusters for Natural Product Synthesis Are Abundant in the Mangrove Swamp Microbiome.新型天然产物合成基因簇在红树林沼泽微生物组中大量存在。
Appl Environ Microbiol. 2023 Jun 28;89(6):e0010223. doi: 10.1128/aem.00102-23. Epub 2023 May 16.
7
Sexual Harassment and Prevention Training性骚扰与预防培训
8
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
9
Healthcare workers' informal uses of mobile phones and other mobile devices to support their work: a qualitative evidence synthesis.医护人员非正规使用手机和其他移动设备来支持工作:定性证据综合评价。
Cochrane Database Syst Rev. 2024 Aug 27;8(8):CD015705. doi: 10.1002/14651858.CD015705.pub2.
10
Autistic Students' Experiences of Employment and Employability Support while Studying at a UK University.自闭症学生在英国大学学习期间的就业经历及就业支持情况
Autism Adulthood. 2025 Apr 3;7(2):212-222. doi: 10.1089/aut.2024.0112. eCollection 2025 Apr.

引用本文的文献

1
Credible inferences in microbiome research: ensuring rigour, reproducibility and relevance in the era of AI.微生物组研究中的可靠推断:在人工智能时代确保严谨性、可重复性和相关性
Nat Rev Gastroenterol Hepatol. 2025 Jul 31. doi: 10.1038/s41575-025-01100-9.
2
Sequence modeling tools to decode the biosynthetic diversity of the human microbiome.用于解码人类微生物组生物合成多样性的序列建模工具。
mSystems. 2025 Jul 22;10(7):e0033325. doi: 10.1128/msystems.00333-25. Epub 2025 Jun 30.
3
A universal language for finding mass spectrometry data patterns.

本文引用的文献

1
iPRESTO: Automated discovery of biosynthetic sub-clusters linked to specific natural product substructures.iPRESTO:与特定天然产物亚结构相关的生物合成亚簇的自动发现。
PLoS Comput Biol. 2023 Feb 9;19(2):e1010462. doi: 10.1371/journal.pcbi.1010462. eCollection 2023 Feb.
2
Enhanced correlation-based linking of biosynthetic gene clusters to their metabolic products through chemical class matching.通过化学分类匹配增强生物合成基因簇与其代谢产物的相关性链接。
Microbiome. 2023 Jan 23;11(1):13. doi: 10.1186/s40168-022-01444-3.
3
MS2DeepScore: a novel deep learning similarity measure to compare tandem mass spectra.
一种用于查找质谱数据模式的通用语言。
Nat Methods. 2025 May 12. doi: 10.1038/s41592-025-02660-z.
4
Pattern-Based Genome Mining Guides Discovery of the Antibiotic Indanopyrrole A from a Marine Streptomycete.基于模式的基因组挖掘指导从海洋链霉菌中发现抗生素茚并吡咯A。
J Nat Prod. 2024 Dec 27;87(12):2768-2778. doi: 10.1021/acs.jnatprod.4c00934. Epub 2024 Nov 22.
5
Pattern-based genome mining guides discovery of the antibiotic indanopyrrole A from a marine streptomycetef.基于模式的基因组挖掘指导从海洋链霉菌中发现抗生素茚并吡咯A。
bioRxiv. 2024 Oct 29:2024.10.29.620887. doi: 10.1101/2024.10.29.620887.
6
Discovering type I cis-AT polyketides through computational mass spectrometry and genome mining with Seq2PKS.通过计算质谱和基因组挖掘 Seq2PKS 发现 I 型顺式-AT 聚酮化合物。
Nat Commun. 2024 Jun 25;15(1):5356. doi: 10.1038/s41467-024-49587-1.
7
Progress and challenges in exploring aquatic microbial communities using non-targeted metabolomics.利用非靶向代谢组学探索水生微生物群落的进展与挑战。
ISME J. 2023 Dec;17(12):2147-2159. doi: 10.1038/s41396-023-01532-8. Epub 2023 Oct 19.
8
Metabologenomics analysis of sp. So3.2b, an Antarctic strain with bioactivity against .对南极菌株So3.2b进行代谢物基因组学分析,该菌株具有针对……的生物活性。 (原文中“against.”后面内容不完整)
Front Microbiol. 2023 May 4;14:1187321. doi: 10.3389/fmicb.2023.1187321. eCollection 2023.
9
Enhanced correlation-based linking of biosynthetic gene clusters to their metabolic products through chemical class matching.通过化学分类匹配增强生物合成基因簇与其代谢产物的相关性链接。
Microbiome. 2023 Jan 23;11(1):13. doi: 10.1186/s40168-022-01444-3.
MS2DeepScore:一种用于比较串联质谱的新型深度学习相似性度量方法。
J Cheminform. 2021 Oct 29;13(1):84. doi: 10.1186/s13321-021-00558-4.
4
Nerpa: A Tool for Discovering Biosynthetic Gene Clusters of Bacterial Nonribosomal Peptides.Nerpa:一种用于发现细菌非核糖体肽生物合成基因簇的工具。
Metabolites. 2021 Oct 11;11(10):693. doi: 10.3390/metabo11100693.
5
NPClassifier: A Deep Neural Network-Based Structural Classification Tool for Natural Products.NPClassifier:一种基于深度神经网络的天然产物结构分类工具。
J Nat Prod. 2021 Nov 26;84(11):2795-2807. doi: 10.1021/acs.jnatprod.1c00399. Epub 2021 Oct 18.
6
Integrating genomics and metabolomics for scalable non-ribosomal peptide discovery.整合基因组学和代谢组学以实现可扩展的非核糖体肽发现。
Nat Commun. 2021 May 28;12(1):3225. doi: 10.1038/s41467-021-23502-4.
7
A Machine Learning Bioinformatics Method to Predict Biological Activity from Biosynthetic Gene Clusters.一种基于机器学习的生物信息学方法,可从生物合成基因簇中预测生物活性。
J Chem Inf Model. 2021 Jun 28;61(6):2560-2571. doi: 10.1021/acs.jcim.0c01304. Epub 2021 May 27.
8
Ranking microbial metabolomic and genomic links in the NPLinker framework using complementary scoring functions.在 NPLinker 框架中使用互补评分函数对微生物代谢组学和基因组学关联进行排名。
PLoS Comput Biol. 2021 May 4;17(5):e1008920. doi: 10.1371/journal.pcbi.1008920. eCollection 2021 May.
9
Spec2Vec: Improved mass spectral similarity scoring through learning of structural relationships.Spec2Vec:通过学习结构关系提高质谱相似性评分。
PLoS Comput Biol. 2021 Feb 16;17(2):e1008724. doi: 10.1371/journal.pcbi.1008724. eCollection 2021 Feb.
10
A community resource for paired genomic and metabolomic data mining.用于基因组和代谢组学数据挖掘的社区资源。
Nat Chem Biol. 2021 Apr;17(4):363-368. doi: 10.1038/s41589-020-00724-z.