• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度学习的代谢物注释分子指纹预测

Deep Learning-Based Molecular Fingerprint Prediction for Metabolite Annotation.

作者信息

Chau Hoi Yan Katharine, Zhang Xinran, Ressom Habtom W

机构信息

Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University Medical Center, Washington, DC 20057, USA.

出版信息

Metabolites. 2025 Feb 14;15(2):132. doi: 10.3390/metabo15020132.

DOI:10.3390/metabo15020132
PMID:39997757
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11857613/
Abstract

Liquid chromatography coupled with mass spectrometry (LC-MS) is a commonly used platform for many metabolomics studies. However, metabolite annotation has been a major bottleneck in these studies in part due to the limited publicly available spectral libraries, which consist of tandem mass spectrometry (MS/MS) data acquired from just a fraction of known compounds. Application of deep learning methods is increasingly reported as an alternative to spectral matching due to their ability to map complex relationships between molecular fingerprints and mass spectrometric measurements. The objectives of this study are to investigate deep learning methods for molecular fingerprint based on MS/MS spectra and to rank putative metabolite IDs according to similarity of their known and predicted molecular fingerprints. : We trained three types of deep learning methods to model the relationships between molecular fingerprints and MS/MS spectra. Prior to training, various data processing steps, including scaling, binning, and filtering, were performed on MS/MS spectra obtained from National Institute of Standards and Technology (NIST), MassBank of North America (MoNA), and Human Metabolome Database (HMDB). Furthermore, selection of the most relevant / bins and molecular fingerprints was conducted. The trained deep learning models were evaluated on ranking putative metabolite IDs obtained from a compound database for the challenges in Critical Assessment of Small Molecule Identification (CASMI) 2016, CASMI 2017, and CASMI 2022 benchmark datasets. : Feature selection methods effectively reduced redundant molecular and spectral features prior to model training. Deep learning methods trained with the truncated features have shown comparable performances against CSI:FingerID on ranking putative metabolite IDs. : The results demonstrate a promising potential of deep learning methods for metabolite annotation.

摘要

液相色谱-质谱联用(LC-MS)是许多代谢组学研究中常用的平台。然而,代谢物注释一直是这些研究中的主要瓶颈,部分原因是公开可用的光谱库有限,这些光谱库仅包含从一小部分已知化合物获取的串联质谱(MS/MS)数据。由于深度学习方法能够绘制分子指纹与质谱测量之间的复杂关系,越来越多的研究报告将其作为光谱匹配的替代方法。本研究的目的是研究基于MS/MS光谱的深度学习分子指纹方法,并根据已知和预测分子指纹的相似性对假定的代谢物ID进行排序。我们训练了三种类型的深度学习方法来模拟分子指纹与MS/MS光谱之间的关系。在训练之前,对从美国国家标准与技术研究院(NIST)、北美质谱库(MoNA)和人类代谢组数据库(HMDB)获得的MS/MS光谱进行了各种数据处理步骤,包括缩放、装箱和过滤。此外,还进行了最相关/箱和分子指纹的选择。在对从小分子鉴定关键评估(CASMI)2016、CASMI 2017和CASMI 2022基准数据集的化合物数据库中获得的假定代谢物ID进行排序时,对训练好的深度学习模型进行了评估。特征选择方法在模型训练之前有效地减少了冗余的分子和光谱特征。用截断特征训练的深度学习方法在对假定代谢物ID进行排序时,表现出与CSI:FingerID相当的性能。结果表明,深度学习方法在代谢物注释方面具有广阔的应用前景。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe83/11857613/8019e3a62c98/metabolites-15-00132-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe83/11857613/e0a0285629ad/metabolites-15-00132-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe83/11857613/8019e3a62c98/metabolites-15-00132-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe83/11857613/e0a0285629ad/metabolites-15-00132-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe83/11857613/8019e3a62c98/metabolites-15-00132-g002.jpg

相似文献

1
Deep Learning-Based Molecular Fingerprint Prediction for Metabolite Annotation.基于深度学习的代谢物注释分子指纹预测
Metabolites. 2025 Feb 14;15(2):132. doi: 10.3390/metabo15020132.
2
Convolutional Neural Network-Based Compound Fingerprint Prediction for Metabolite Annotation.基于卷积神经网络的代谢物注释复合指纹预测
Metabolites. 2022 Jun 29;12(7):605. doi: 10.3390/metabo12070605.
3
Deep Learning Based Metabolite Annotation.基于深度学习的代谢物注释
Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-4. doi: 10.1109/EMBC40787.2023.10341007.
4
MetFID: artificial neural network-based compound fingerprint prediction for metabolite annotation.MetFID:基于人工神经网络的化合物指纹预测代谢物注释。
Metabolomics. 2020 Sep 30;16(10):104. doi: 10.1007/s11306-020-01726-7.
5
compMS2Miner: An Automatable Metabolite Identification, Visualization, and Data-Sharing R Package for High-Resolution LC-MS Data Sets.compMS2Miner:一个用于高分辨 LC-MS 数据集的自动化代谢物鉴定、可视化和数据共享 R 包。
Anal Chem. 2017 Apr 4;89(7):3919-3928. doi: 10.1021/acs.analchem.6b02394. Epub 2017 Mar 27.
6
Metabolite Identification through Machine Learning- Tackling CASMI Challenge Using FingerID.通过机器学习进行代谢物鉴定——使用FingerID应对CASMI挑战
Metabolites. 2013 Jun 6;3(2):484-505. doi: 10.3390/metabo3020484.
7
IDSL_MINT: a deep learning framework to predict molecular fingerprints from mass spectra.IDSL_MINT:一种用于从质谱预测分子指纹的深度学习框架。
J Cheminform. 2024 Jan 18;16(1):8. doi: 10.1186/s13321-024-00804-5.
8
GC-EI-MS datasets of trimethylsilyl (TMS) and -butyl dimethyl silyl (TBDMS) derivatives for development of machine learning-based compound identification approaches.用于开发基于机器学习的化合物鉴定方法的三甲基硅烷基(TMS)和叔丁基二甲基硅烷基(TBDMS)衍生物的气相色谱-电子电离质谱数据集。
Data Brief. 2023 Apr 11;48:109138. doi: 10.1016/j.dib.2023.109138. eCollection 2023 Jun.
9
Deep kernel learning improves molecular fingerprint prediction from tandem mass spectra.深度学习提高串联质谱分子指纹预测。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i342-i349. doi: 10.1093/bioinformatics/btac260.
10
A Structure-Guided Molecular Network Strategy for Global Untargeted Metabolomics Data Annotation.基于结构导向的分子网络策略进行全局非靶向代谢组学数据注释。
Anal Chem. 2023 Aug 8;95(31):11603-11612. doi: 10.1021/acs.analchem.3c00849. Epub 2023 Jul 26.

本文引用的文献

1
METASPACE-ML: Context-specific metabolite annotation for imaging mass spectrometry using machine learning.METASPACE-ML:基于机器学习的成像质谱代谢物特异性注释
Nat Commun. 2024 Oct 22;15(1):9110. doi: 10.1038/s41467-024-52213-9.
2
IDSL_MINT: a deep learning framework to predict molecular fingerprints from mass spectra.IDSL_MINT:一种用于从质谱预测分子指纹的深度学习框架。
J Cheminform. 2024 Jan 18;16(1):8. doi: 10.1186/s13321-024-00804-5.
3
Deep Learning Based Metabolite Annotation.基于深度学习的代谢物注释
Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-4. doi: 10.1109/EMBC40787.2023.10341007.
4
MIST-CF: Chemical Formula Inference from Tandem Mass Spectra.MIST-CF:从串联质谱推断化学式。
J Chem Inf Model. 2024 Apr 8;64(7):2421-2431. doi: 10.1021/acs.jcim.3c01082. Epub 2023 Sep 19.
5
BUDDY: molecular formula discovery via bottom-up MS/MS interrogation.通过自下而上的 MS/MS 询问发现分子公式。
Nat Methods. 2023 Jun;20(6):881-890. doi: 10.1038/s41592-023-01850-x. Epub 2023 Apr 13.
6
Convolutional Neural Network-Based Compound Fingerprint Prediction for Metabolite Annotation.基于卷积神经网络的代谢物注释复合指纹预测
Metabolites. 2022 Jun 29;12(7):605. doi: 10.3390/metabo12070605.
7
Deep kernel learning improves molecular fingerprint prediction from tandem mass spectra.深度学习提高串联质谱分子指纹预测。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i342-i349. doi: 10.1093/bioinformatics/btac260.
8
MSNovelist: de novo structure generation from mass spectra.MSNovelist:从头开始从质谱生成结构。
Nat Methods. 2022 Jul;19(7):865-870. doi: 10.1038/s41592-022-01486-3. Epub 2022 May 30.
9
HMDB 5.0: the Human Metabolome Database for 2022.HMDB 5.0:2022 年人类代谢组数据库。
Nucleic Acids Res. 2022 Jan 7;50(D1):D622-D631. doi: 10.1093/nar/gkab1062.
10
High-confidence structural annotation of metabolites absent from spectral libraries.高可信度的代谢物结构注释,这些代谢物在光谱库中不存在。
Nat Biotechnol. 2022 Mar;40(3):411-421. doi: 10.1038/s41587-021-01045-9. Epub 2021 Oct 14.