• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MS2Query:可靠且可扩展的基于 MS 质谱的模拟搜索。

MS2Query: reliable and scalable MS mass spectra-based analogue search.

机构信息

Bioinformatics Group, Wageningen University & Research, 6708 PB, Wageningen, the Netherlands.

National Phenome Centre, Section of Bioanalytical Chemistry, Division of Systems Medicine, Department of Metabolism, Digestion and Reproduction, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, W12 0NN, UK.

出版信息

Nat Commun. 2023 Mar 29;14(1):1752. doi: 10.1038/s41467-023-37446-4.

DOI:10.1038/s41467-023-37446-4
PMID:36990978
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10060387/
Abstract

Metabolomics-driven discoveries of biological samples remain hampered by the grand challenge of metabolite annotation and identification. Only few metabolites have an annotated spectrum in spectral libraries; hence, searching only for exact library matches generally returns a few hits. An attractive alternative is searching for so-called analogues as a starting point for structural annotations; analogues are library molecules which are not exact matches but display a high chemical similarity. However, current analogue search implementations are not yet very reliable and relatively slow. Here, we present MS2Query, a machine learning-based tool that integrates mass spectral embedding-based chemical similarity predictors (Spec2Vec and MS2Deepscore) as well as detected precursor masses to rank potential analogues and exact matches. Benchmarking MS2Query on reference mass spectra and experimental case studies demonstrate improved reliability and scalability. Thereby, MS2Query offers exciting opportunities to further increase the annotation rate of metabolomics profiles of complex metabolite mixtures and to discover new biology.

摘要

代谢组学驱动的生物样本发现仍然受到代谢物注释和鉴定这一重大挑战的阻碍。只有少数代谢物在光谱库中有被注释的光谱;因此,仅搜索精确的库匹配通常只能得到少数命中。一个有吸引力的替代方法是寻找所谓的类似物作为结构注释的起点;类似物是库分子,它们不是精确匹配,但显示出高度的化学相似性。然而,目前的类似物搜索实现还不是非常可靠和相对较慢。在这里,我们介绍了 MS2Query,这是一个基于机器学习的工具,它集成了基于质荷比嵌入的化学相似性预测器(Spec2Vec 和 MS2Deepscore)以及检测到的前体质量,以对潜在的类似物和精确匹配进行排名。在参考质谱和实验案例研究上对 MS2Query 进行基准测试,证明了其可靠性和可扩展性的提高。因此,MS2Query 为进一步提高复杂代谢物混合物代谢组学图谱的注释率和发现新生物学提供了令人兴奋的机会。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655f/10060387/9e99755126df/41467_2023_37446_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655f/10060387/13696a223f48/41467_2023_37446_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655f/10060387/f5a4a2cddab1/41467_2023_37446_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655f/10060387/b7450c5a068c/41467_2023_37446_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655f/10060387/dd96c5ad56f6/41467_2023_37446_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655f/10060387/9e99755126df/41467_2023_37446_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655f/10060387/13696a223f48/41467_2023_37446_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655f/10060387/f5a4a2cddab1/41467_2023_37446_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655f/10060387/b7450c5a068c/41467_2023_37446_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655f/10060387/dd96c5ad56f6/41467_2023_37446_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/655f/10060387/9e99755126df/41467_2023_37446_Fig5_HTML.jpg

相似文献

1
MS2Query: reliable and scalable MS mass spectra-based analogue search.MS2Query:可靠且可扩展的基于 MS 质谱的模拟搜索。
Nat Commun. 2023 Mar 29;14(1):1752. doi: 10.1038/s41467-023-37446-4.
2
Good practices and recommendations for using and benchmarking computational metabolomics metabolite annotation tools.使用和基准测试计算代谢组学生物标志物注释工具的良好实践和建议。
Metabolomics. 2022 Dec 5;18(12):103. doi: 10.1007/s11306-022-01963-y.
3
Spec2Vec: Improved mass spectral similarity scoring through learning of structural relationships.Spec2Vec:通过学习结构关系提高质谱相似性评分。
PLoS Comput Biol. 2021 Feb 16;17(2):e1008724. doi: 10.1371/journal.pcbi.1008724. eCollection 2021 Feb.
4
MS2DeepScore: a novel deep learning similarity measure to compare tandem mass spectra.MS2DeepScore:一种用于比较串联质谱的新型深度学习相似性度量方法。
J Cheminform. 2021 Oct 29;13(1):84. doi: 10.1186/s13321-021-00558-4.
5
Deciphering complex metabolite mixtures by unsupervised and supervised substructure discovery and semi-automated annotation from MS/MS spectra.通过无监督和有监督的子结构发现以及从 MS/MS 光谱进行半自动注释来破译复杂代谢物混合物。
Faraday Discuss. 2019 Aug 15;218(0):284-302. doi: 10.1039/c8fd00235e.
6
compMS2Miner: An Automatable Metabolite Identification, Visualization, and Data-Sharing R Package for High-Resolution LC-MS Data Sets.compMS2Miner:一个用于高分辨 LC-MS 数据集的自动化代谢物鉴定、可视化和数据共享 R 包。
Anal Chem. 2017 Apr 4;89(7):3919-3928. doi: 10.1021/acs.analchem.6b02394. Epub 2017 Mar 27.
7
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
8
Deep Learning Driven GC-MS Library Search and Its Application for Metabolomics.深度学习驱动的气相色谱-质谱联用库检索及其在代谢组学中的应用
Anal Chem. 2020 Sep 1;92(17):11818-11825. doi: 10.1021/acs.analchem.0c02082. Epub 2020 Aug 12.
9
[A novel method for efficient screening and annotation of important pathway-associated metabolites based on the modified metabolome and probe molecules].一种基于改良代谢组和探针分子的重要通路相关代谢物高效筛选与注释新方法
Se Pu. 2022 Sep;40(9):788-796. doi: 10.3724/SP.J.1123.2022.03025.
10
Metabolite identification using automated comparison of high-resolution multistage mass spectral trees.使用高分辨多级质谱树的自动比较进行代谢产物鉴定。
Anal Chem. 2012 Jul 3;84(13):5524-34. doi: 10.1021/ac2034216. Epub 2012 Jun 22.

引用本文的文献

1
Multilaboratory Untargeted Mass Spectrometry Metabolomics Collaboration to Identify Bottlenecks and Comprehensively Annotate A Single Dataset.多实验室非靶向质谱代谢组学协作以识别瓶颈并全面注释单个数据集
Anal Chem. 2025 Aug 5;97(30):16110-16122. doi: 10.1021/acs.analchem.4c05577. Epub 2025 Jul 22.
2
Bridging Ethnobotanical Knowledge and Multi-Omics Approaches for Plant-Derived Natural Product Discovery.架起民族植物学知识与多组学方法之间的桥梁以发现植物源天然产物
Metabolites. 2025 May 29;15(6):362. doi: 10.3390/metabo15060362.
3
Microplastics in the Soil at Sub-Toxic Concentrations Cause Metabolic Changes Decreasing Fungal Pathogen Susceptibility in Arabidopsis thaliana.

本文引用的文献

1
An Ensemble Spectral Prediction (ESP) model for metabolite annotation.用于代谢物注释的集成谱预测 (ESP) 模型。
Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae490.
2
Open access repository-scale propagated nearest neighbor suspect spectral library for untargeted metabolomics.开放获取存储库规模传播的近邻可疑光谱库,用于无目标代谢组学。
Nat Commun. 2023 Dec 20;14(1):8488. doi: 10.1038/s41467-023-44035-y.
3
Good practices and recommendations for using and benchmarking computational metabolomics metabolite annotation tools.
土壤中亚毒性浓度的微塑料会导致代谢变化,降低拟南芥对真菌病原体的易感性。
Physiol Plant. 2025 May-Jun;177(3):e70312. doi: 10.1111/ppl.70312.
4
Olive mill solid waste induces beneficial mushroom-specialized metabolite diversity revealed by computational metabolomics strategies.油橄榄榨油厂固体废物诱导产生有益的蘑菇特异性代谢物多样性,这是通过计算代谢组学策略揭示的。
Metabolomics. 2025 Apr 26;21(3):58. doi: 10.1007/s11306-025-02257-9.
5
Mapping growth differentiation factor-15 (GDF15)-mediated signaling pathways in cancer: insights into its role across different cancer types.绘制癌症中生长分化因子15(GDF15)介导的信号通路:洞察其在不同癌症类型中的作用
Discov Oncol. 2025 Mar 25;16(1):386. doi: 10.1007/s12672-025-02121-1.
6
Synthetic Biology in Natural Product Biosynthesis.天然产物生物合成中的合成生物学
Chem Rev. 2025 Apr 9;125(7):3814-3931. doi: 10.1021/acs.chemrev.4c00567. Epub 2025 Mar 21.
7
Discovering organic reactions with a machine-learning-powered deciphering of tera-scale mass spectrometry data.通过机器学习驱动的太赫兹级质谱数据解析发现有机反应。
Nat Commun. 2025 Mar 16;16(1):2587. doi: 10.1038/s41467-025-56905-8.
8
Machine Learning-based Classification for the Prioritization of Potentially Hazardous Chemicals with Structural Alerts in Nontarget Screening.基于机器学习的非靶向筛查中具有结构警示的潜在危险化学品优先级分类
Environ Sci Technol. 2025 Mar 18;59(10):5056-5065. doi: 10.1021/acs.est.4c10498. Epub 2025 Mar 7.
9
Molecular Structure Discovery for Untargeted Metabolomics Using Biotransformation Rules and Global Molecular Networking.利用生物转化规则和全局分子网络进行非靶向代谢组学的分子结构发现
Anal Chem. 2025 Feb 18;97(6):3213-3219. doi: 10.1021/acs.analchem.4c01565. Epub 2025 Feb 4.
10
Multidrug efflux pumps of show selectivity for their natural substrates.多药外排泵对其天然底物具有选择性。
Front Microbiol. 2025 Jan 9;15:1512472. doi: 10.3389/fmicb.2024.1512472. eCollection 2024.
使用和基准测试计算代谢组学生物标志物注释工具的良好实践和建议。
Metabolomics. 2022 Dec 5;18(12):103. doi: 10.1007/s11306-022-01963-y.
4
Comparison of Cosine, Modified Cosine, and Neutral Loss Based Spectrum Alignment For Discovery of Structurally Related Molecules.余弦、修正余弦和中性丢失的谱对齐方法比较在结构相关分子发现中的应用。
J Am Soc Mass Spectrom. 2022 Sep 7;33(9):1733-1744. doi: 10.1021/jasms.2c00153. Epub 2022 Aug 12.
5
MSNovelist: de novo structure generation from mass spectra.MSNovelist:从头开始从质谱生成结构。
Nat Methods. 2022 Jul;19(7):865-870. doi: 10.1038/s41592-022-01486-3. Epub 2022 May 30.
6
MassGenie: A Transformer-Based Deep Learning Method for Identifying Small Molecules from Their Mass Spectra.MassGenie:一种基于 Transformer 的深度学习方法,用于从其质谱中识别小分子。
Biomolecules. 2021 Nov 30;11(12):1793. doi: 10.3390/biom11121793.
7
Advances in decomposing complex metabolite mixtures using substructure- and network-based computational metabolomics approaches.利用基于子结构和网络的计算代谢组学方法分解复杂代谢物混合物的进展。
Nat Prod Rep. 2021 Nov 17;38(11):1967-1993. doi: 10.1039/d1np00023c.
8
MS2DeepScore: a novel deep learning similarity measure to compare tandem mass spectra.MS2DeepScore:一种用于比较串联质谱的新型深度学习相似性度量方法。
J Cheminform. 2021 Oct 29;13(1):84. doi: 10.1186/s13321-021-00558-4.
9
Advancements in capturing and mining mass spectrometry data are transforming natural products research.质谱数据采集和挖掘技术的进步正在改变天然产物研究。
Nat Prod Rep. 2021 Nov 17;38(11):2066-2082. doi: 10.1039/d1np00040c.
10
Advanced tandem mass spectrometry in metabolomics and lipidomics-methods and applications.代谢组学和脂质组学中的高级串联质谱法——方法与应用。
Anal Bioanal Chem. 2021 Oct;413(24):5927-5948. doi: 10.1007/s00216-021-03425-1. Epub 2021 Jun 18.