• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

计算方法在代谢物鉴定中的最新进展和展望:重点综述机器学习方法。

Recent advances and prospects of computational methods for metabolite identification: a review with emphasis on machine learning approaches.

机构信息

Department of machine learning and bioinformatics, Bioinformatics Center, Kyoto University, Uji, Japan.

Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Japan.

出版信息

Brief Bioinform. 2019 Nov 27;20(6):2028-2043. doi: 10.1093/bib/bby066.

DOI:10.1093/bib/bby066
PMID:30099485
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6954430/
Abstract

Metabolomics involves studies of a great number of metabolites, which are small molecules present in biological systems. They play a lot of important functions such as energy transport, signaling, building block of cells and inhibition/catalysis. Understanding biochemical characteristics of the metabolites is an essential and significant part of metabolomics to enlarge the knowledge of biological systems. It is also the key to the development of many applications and areas such as biotechnology, biomedicine or pharmaceuticals. However, the identification of the metabolites remains a challenging task in metabolomics with a huge number of potentially interesting but unknown metabolites. The standard method for identifying metabolites is based on the mass spectrometry (MS) preceded by a separation technique. Over many decades, many techniques with different approaches have been proposed for MS-based metabolite identification task, which can be divided into the following four groups: mass spectra database, in silico fragmentation, fragmentation tree and machine learning. In this review paper, we thoroughly survey currently available tools for metabolite identification with the focus on in silico fragmentation, and machine learning-based approaches. We also give an intensive discussion on advanced machine learning methods, which can lead to further improvement on this task.

摘要

代谢组学涉及大量代谢物的研究,这些代谢物是存在于生物系统中的小分子。它们具有许多重要的功能,如能量运输、信号传递、细胞构建块和抑制/催化。了解代谢物的生化特性是代谢组学的一个重要组成部分,它可以扩大对生物系统的认识。它也是生物技术、生物医学或制药等许多应用和领域发展的关键。然而,代谢物的鉴定仍然是代谢组学中的一个具有挑战性的任务,因为有大量潜在的但未知的代谢物。鉴定代谢物的标准方法是基于质谱 (MS) ,之前是分离技术。在过去的几十年里,已经提出了许多具有不同方法的技术来进行基于 MS 的代谢物鉴定任务,这些技术可以分为以下四类:质谱数据库、计算机模拟碎片、碎片树和机器学习。在这篇综述文章中,我们全面调查了目前可用于代谢物鉴定的工具,重点是计算机模拟碎片和基于机器学习的方法。我们还对先进的机器学习方法进行了深入讨论,这些方法可以进一步提高这个任务的效果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/1c85b7daf564/bby066f15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/a16bbb0ef9e9/bby066f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/0e8ca3767266/bby066f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/1eaa5d4cc17a/bby066f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/f892bb018829/bby066f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/0cb4e87ee9a2/bby066f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/049c04eaedca/bby066f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/cd9f15e316c4/bby066f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/5bdd7cfa1546/bby066f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/60b0b6bf0f5f/bby066f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/4b125cd6e752/bby066f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/bc1c932df2b5/bby066f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/e8c7fd682479/bby066f12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/ff6864213aee/bby066f13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/d02b4e98d22b/bby066f14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/1c85b7daf564/bby066f15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/a16bbb0ef9e9/bby066f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/0e8ca3767266/bby066f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/1eaa5d4cc17a/bby066f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/f892bb018829/bby066f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/0cb4e87ee9a2/bby066f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/049c04eaedca/bby066f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/cd9f15e316c4/bby066f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/5bdd7cfa1546/bby066f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/60b0b6bf0f5f/bby066f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/4b125cd6e752/bby066f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/bc1c932df2b5/bby066f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/e8c7fd682479/bby066f12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/ff6864213aee/bby066f13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/d02b4e98d22b/bby066f14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5373/6954430/1c85b7daf564/bby066f15.jpg

相似文献

1
Recent advances and prospects of computational methods for metabolite identification: a review with emphasis on machine learning approaches.计算方法在代谢物鉴定中的最新进展和展望:重点综述机器学习方法。
Brief Bioinform. 2019 Nov 27;20(6):2028-2043. doi: 10.1093/bib/bby066.
2
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
3
Good practices and recommendations for using and benchmarking computational metabolomics metabolite annotation tools.使用和基准测试计算代谢组学生物标志物注释工具的良好实践和建议。
Metabolomics. 2022 Dec 5;18(12):103. doi: 10.1007/s11306-022-01963-y.
4
Automated pathway and reaction prediction facilitates in silico identification of unknown metabolites in human cohort studies.自动化途径和反应预测有助于在人类队列研究中通过计算机模拟鉴定未知代谢物。
J Chromatogr B Analyt Technol Biomed Life Sci. 2017 Dec 15;1071:58-67. doi: 10.1016/j.jchromb.2017.04.002. Epub 2017 Apr 4.
5
Searching molecular structure databases with tandem mass spectra using CSI:FingerID.使用CSI:FingerID通过串联质谱搜索分子结构数据库。
Proc Natl Acad Sci U S A. 2015 Oct 13;112(41):12580-5. doi: 10.1073/pnas.1509788112. Epub 2015 Sep 21.
6
Overview of Tandem Mass Spectral and Metabolite Databases for Metabolite Identification in Metabolomics.串联质谱和代谢物数据库概述,用于代谢组学中的代谢物鉴定。
Methods Mol Biol. 2020;2104:139-148. doi: 10.1007/978-1-0716-0239-3_8.
7
A map of mass spectrometry-based in silico fragmentation prediction and compound identification in metabolomics.基于质谱的代谢组学中计算机模拟碎裂预测和化合物鉴定的图谱。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab073.
8
Metabolite identification through multiple kernel learning on fragmentation trees.基于碎裂树的多核学习进行代谢产物鉴定。
Bioinformatics. 2014 Jun 15;30(12):i157-64. doi: 10.1093/bioinformatics/btu275.
9
Deciphering complex metabolite mixtures by unsupervised and supervised substructure discovery and semi-automated annotation from MS/MS spectra.通过无监督和有监督的子结构发现以及从 MS/MS 光谱进行半自动注释来破译复杂代谢物混合物。
Faraday Discuss. 2019 Aug 15;218(0):284-302. doi: 10.1039/c8fd00235e.
10
Recent advances in mass spectrometry-based computational metabolomics.基于质谱的计算代谢组学的最新进展。
Curr Opin Chem Biol. 2023 Jun;74:102288. doi: 10.1016/j.cbpa.2023.102288. Epub 2023 Mar 24.

引用本文的文献

1
An approach of molecular-fingerprint prediction implementing a GAT.一种实现图注意力网络(GAT)的分子指纹预测方法。
RSC Adv. 2025 Apr 22;15(16):12757-12764. doi: 10.1039/d5ra00973a. eCollection 2025 Apr 16.
2
Prediction of Fragmentation Pathway of Natural Products, Antibiotics, and Pesticides by ChemFrag.通过ChemFrag预测天然产物、抗生素和农药的碎片化途径
J Mass Spectrom. 2025 May;60(5):e5129. doi: 10.1002/jms.5129.
3
Machine Learning-based Classification for the Prioritization of Potentially Hazardous Chemicals with Structural Alerts in Nontarget Screening.

本文引用的文献

1
MESSAR: Automated recommendation of metabolite substructures from tandem mass spectra.MESSAR:串联质谱中代谢物亚结构的自动推荐。
PLoS One. 2020 Jan 16;15(1):e0226770. doi: 10.1371/journal.pone.0226770. eCollection 2020.
2
SIMPLE: Sparse Interaction Model over Peaks of moLEcules for fast, interpretable metabolite identification from tandem mass spectra.SIMPLE:基于分子峰的稀疏相互作用模型,用于从串联质谱中快速、可解释地鉴定代谢物。
Bioinformatics. 2018 Jul 1;34(13):i323-i332. doi: 10.1093/bioinformatics/bty252.
3
HMDB 4.0: the human metabolome database for 2018.
基于机器学习的非靶向筛查中具有结构警示的潜在危险化学品优先级分类
Environ Sci Technol. 2025 Mar 18;59(10):5056-5065. doi: 10.1021/acs.est.4c10498. Epub 2025 Mar 7.
4
Current approaches and outstanding challenges of functional annotation of metabolites: a comprehensive review.当前代谢物功能注释的方法和突出挑战:全面综述。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae498.
5
Machine learning for metabolic pathway optimization: A review.用于代谢途径优化的机器学习:综述
Comput Struct Biotechnol J. 2023 Mar 27;21:2381-2393. doi: 10.1016/j.csbj.2023.03.045. eCollection 2023.
6
Chemprop: A Machine Learning Package for Chemical Property Prediction.Chemprop:一个用于化学性质预测的机器学习工具包。
J Chem Inf Model. 2024 Jan 8;64(1):9-17. doi: 10.1021/acs.jcim.3c01250. Epub 2023 Dec 26.
7
Data-Driven Compound Identification in Atmospheric Mass Spectrometry.大气质谱中数据驱动的化合物识别
Adv Sci (Weinh). 2024 Feb;11(8):e2306235. doi: 10.1002/advs.202306235. Epub 2023 Dec 14.
8
Microbial Metabolites Annotation by Mass Spectrometry-Based Metabolomics.基于质谱代谢组学的微生物代谢产物注释。
Adv Exp Med Biol. 2023;1439:225-248. doi: 10.1007/978-3-031-41741-2_9.
9
Exploring chemical space for lead identification by propagating on chemical similarity network.通过在化学相似性网络上进行传播来探索用于先导物识别的化学空间。
Comput Struct Biotechnol J. 2023 Aug 25;21:4187-4195. doi: 10.1016/j.csbj.2023.08.016. eCollection 2023.
10
An end-to-end deep learning framework for translating mass spectra to de-novo molecules.一种用于将质谱图翻译为从头合成分子的端到端深度学习框架。
Commun Chem. 2023 Jun 23;6(1):132. doi: 10.1038/s42004-023-00932-3.
HMDB 4.0:2018 年人类代谢组数据库。
Nucleic Acids Res. 2018 Jan 4;46(D1):D608-D617. doi: 10.1093/nar/gkx1089.
4
Topic modeling for untargeted substructure exploration in metabolomics.代谢组学中用于非靶向子结构探索的主题建模
Proc Natl Acad Sci U S A. 2016 Nov 29;113(48):13738-13743. doi: 10.1073/pnas.1608041113. Epub 2016 Nov 16.
5
Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking.通过全球天然产物社会分子网络共享和社区管理质谱数据。
Nat Biotechnol. 2016 Aug 9;34(8):828-837. doi: 10.1038/nbt.3597.
6
Fast metabolite identification with Input Output Kernel Regression.使用输入输出核回归进行快速代谢物鉴定。
Bioinformatics. 2016 Jun 15;32(12):i28-i36. doi: 10.1093/bioinformatics/btw246.
7
Learning interactions via hierarchical group-lasso regularization.通过分层组套索正则化学习交互作用。
J Comput Graph Stat. 2015;24(3):627-654. doi: 10.1080/10618600.2014.938812. Epub 2015 Sep 16.
8
Searching molecular structure databases with tandem mass spectra using CSI:FingerID.使用CSI:FingerID通过串联质谱搜索分子结构数据库。
Proc Natl Acad Sci U S A. 2015 Oct 13;112(41):12580-5. doi: 10.1073/pnas.1509788112. Epub 2015 Sep 21.
9
A LASSO FOR HIERARCHICAL INTERACTIONS.用于分层交互的套索法
Ann Stat. 2013 Jun;41(3):1111-1141. doi: 10.1214/13-AOS1096.
10
Using fragmentation trees and mass spectral trees for identifying unknown compounds in metabolomics.利用碎裂树和质谱树鉴定代谢组学中的未知化合物。
Trends Analyt Chem. 2015 Jun 1;69:52-61. doi: 10.1016/j.trac.2015.04.002.