用于非靶向代谢组学中准确代谢物注释的知识和数据驱动的双层网络

Knowledge and data-driven two-layer networking for accurate metabolite annotation in untargeted metabolomics.

作者信息

Zhang Haosong, Zeng Xinhao, Yin Yandong, Zhu Zheng-Jiang

机构信息

Interdisciplinary Research Center on Biology and Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, Shanghai, China.

University of Chinese Academy of Sciences, Beijing, China.

出版信息

Nat Commun. 2025 Aug 30;16(1):8118. doi: 10.1038/s41467-025-63536-6.

DOI:10.1038/s41467-025-63536-6

PMID:40885726

Abstract

Metabolite annotation in untargeted metabolomics remains challenging due to the vast structural diversity of metabolites. Network-based approaches have emerged as powerful strategies, particularly for annotating metabolites lacking chemical standards. Here, we develop a two-layer interactive networking topology that integrates data-driven and knowledge-driven networks to enhance metabolite annotation. A comprehensive metabolic reaction network is curated using graph neural network-based prediction of reaction relationships, enhancing both coverage and network connectivity. Experimental data are pre-mapped onto this network via sequential MS1 matching, reaction relationship mapping, and MS2 similarity constraints. The generated networking topology enables interactive annotation propagation with over 10-fold improved computational efficiency. In common biological samples, it annotates over 1600 seed metabolites with chemical standards and >12,000 putatively annotated metabolites through network-based propagation. Notably, two previously uncharacterized endogenous metabolites absent from human metabolome databases have been discovered. Overall, this strategy significantly improves the coverage, accuracy, and efficiency of metabolite annotation and is freely available as MetDNA3.

摘要

由于代谢物的结构多样性极为广泛，非靶向代谢组学中的代谢物注释仍然具有挑战性。基于网络的方法已成为强大的策略，特别是对于注释缺乏化学标准品的代谢物。在此，我们开发了一种两层交互式网络拓扑结构，该结构整合了数据驱动和知识驱动的网络，以增强代谢物注释。使用基于图神经网络的反应关系预测来精心构建一个全面的代谢反应网络，从而提高覆盖率和网络连通性。通过顺序MS1匹配、反应关系映射和MS2相似性约束，将实验数据预先映射到该网络上。生成的网络拓扑结构能够进行交互式注释传播，计算效率提高了10倍以上。在常见的生物样本中，它通过基于网络的传播，为1600多种有化学标准品的种子代谢物以及超过12,000种推定注释的代谢物进行注释。值得注意的是，发现了人类代谢组数据库中不存在的两种以前未表征的内源性代谢物。总体而言，该策略显著提高了代谢物注释的覆盖率、准确性和效率，并且作为MetDNA3可免费获取。

相似文献

Knowledge and data-driven two-layer networking for accurate metabolite annotation in untargeted metabolomics.

Nat Commun. 2025 Aug 30;16(1):8118. doi: 10.1038/s41467-025-63536-6.

Olive mill solid waste induces beneficial mushroom-specialized metabolite diversity revealed by computational metabolomics strategies.

Metabolomics. 2025 Apr 26;21(3):58. doi: 10.1007/s11306-025-02257-9.

Description of metabolic differences between castrated males and intact gilts obtained from high-throughput metabolomics of porcine plasma.

J Anim Sci. 2025 Jan 4;103. doi: 10.1093/jas/skaf178.

Prescription of Controlled Substances: Benefits and Risks

Unveiling the dark matter of the metabolome: A narrative review of bioinformatics tools for LC-HRMS-based compound annotation.

Talanta. 2025 Dec 1;295:128327. doi: 10.1016/j.talanta.2025.128327. Epub 2025 May 14.

Novel Data-Driven Mechanistic Modeling of Untargeted Metabolome Data Reveals Feed Component Effects in CHO Cell Bioprocess Using Column Generation-Based EFMs.

Biotechnol J. 2025 Jul;20(7):e70008. doi: 10.1002/biot.70008.

Hyperdiverse, bioactive, and interaction-specific metabolites produced only in co-culture suggest diverse competitors may fuel secondary metabolism of xylarialean fungi.

mSystems. 2025 Jul 22;10(7):e0046825. doi: 10.1128/msystems.00468-25. Epub 2025 Jun 9.

MS2MP: A Deep Learning Framework for Metabolic Pathway Prediction from MS/MS-Based Untargeted Metabolomics.

Anal Chem. 2025 Jul 15;97(27):14200-14209. doi: 10.1021/acs.analchem.4c06875. Epub 2025 Jun 30.

Untargeted and semi-targeted metabolomics approach for profiling small intestinal and fecal metabolome using high-resolution mass spectrometry.

Metabolomics. 2025 Jun 19;21(4):84. doi: 10.1007/s11306-025-02288-2.

Combining mechanism-based prediction with patient-based profiling for psoriasis metabolomics biomarker discovery.

AMIA Annu Symp Proc. 2018 Apr 16;2017:1734-1743. eCollection 2017.

本文引用的文献

Molecular Structure Discovery for Untargeted Metabolomics Using Biotransformation Rules and Global Molecular Networking.

Anal Chem. 2025 Feb 18;97(6):3213-3219. doi: 10.1021/acs.analchem.4c01565. Epub 2025 Feb 4.

Met4DX: A Unified and Versatile Data Processing Tool for Multidimensional Untargeted Metabolomics Data.

J Am Soc Mass Spectrom. 2024 Dec 4;35(12):2960-2968. doi: 10.1021/jasms.4c00290. Epub 2024 Oct 14.

Navigating common pitfalls in metabolite identification and metabolomics bioinformatics.

Metabolomics. 2024 Sep 21;20(5):103. doi: 10.1007/s11306-024-02167-2.

Network Topology Evaluation and Transitive Alignments for Molecular Networking.

J Am Soc Mass Spectrom. 2024 Sep 4;35(9):2165-2175. doi: 10.1021/jasms.4c00208. Epub 2024 Aug 12.

AllCCS2: Curation of Ion Mobility Collision Cross-Section Atlas for Small Molecules Using Comprehensive Molecular Representations.

Anal Chem. 2023 Sep 19;95(37):13913-13921. doi: 10.1021/acs.analchem.3c02267. Epub 2023 Sep 4.

A Structure-Guided Molecular Network Strategy for Global Untargeted Metabolomics Data Annotation.

Anal Chem. 2023 Aug 8;95(31):11603-11612. doi: 10.1021/acs.analchem.3c00849. Epub 2023 Jul 26.

BUDDY: molecular formula discovery via bottom-up MS/MS interrogation.

Nat Methods. 2023 Jun;20(6):881-890. doi: 10.1038/s41592-023-01850-x. Epub 2023 Apr 13.

Integrative analysis of multimodal mass spectrometry data in MZmine 3.

Nat Biotechnol. 2023 Apr;41(4):447-449. doi: 10.1038/s41587-023-01690-2.

Metabolite annotation from knowns to unknowns through knowledge-guided multi-layer metabolic networking.

Nat Commun. 2022 Nov 4;13(1):6656. doi: 10.1038/s41467-022-34537-6.

MSNovelist: de novo structure generation from mass spectra.

Nat Methods. 2022 Jul;19(7):865-870. doi: 10.1038/s41592-022-01486-3. Epub 2022 May 30.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于非靶向代谢组学中准确代谢物注释的知识和数据驱动的双层网络

Knowledge and data-driven two-layer networking for accurate metabolite annotation in untargeted metabolomics.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献