• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度学习驱动的气相色谱-质谱联用库检索及其在代谢组学中的应用

Deep Learning Driven GC-MS Library Search and Its Application for Metabolomics.

作者信息

Matyushin Dmitriy D, Sholokhova Anastasia Yu, Buryak Aleksey K

机构信息

A.N. Frumkin Institute of Physical Chemistry and Electrochemistry, Russian Academy of Sciences, 31 Leninsky Prospect, Moscow, GSP-1, 119071, Russia.

出版信息

Anal Chem. 2020 Sep 1;92(17):11818-11825. doi: 10.1021/acs.analchem.0c02082. Epub 2020 Aug 12.

DOI:10.1021/acs.analchem.0c02082
PMID:32867500
Abstract

Preliminary compound identification and peak annotation in gas chromatography-mass spectrometry is usually made using mass spectral databases. There are a few algorithms that enable performing a search of a spectrum in a large mass spectral library. In many cases, a library search procedure returns a wrong answer even if a correct compound is contained in a library. In this work, we present a deep learning driven approach to a library search in order to reduce the probability of such cases. Machine learning ranking (learning to rank) is a class of machine learning and deep learning algorithms that perform a comparison (ranking) of objects. This work introduces the usage of deep learning ranking for small molecules identification using low-resolution electron ionization mass spectrometry. Instead of simple similarity measures for two spectra, such as the dot product or the Euclidean distance between vectors that represent spectra, a deep convolutional neural network is used. The deep learning ranking model outperforms other approaches and enables reducing a fraction of wrong answers (at rank-1) by 9-23% depending on the used data set. Spectra from the Golm Metabolome Database, Human Metabolome Database, and FiehnLib were used for testing the model.

摘要

气相色谱 - 质谱联用中的初步化合物鉴定和峰注释通常使用质谱数据库进行。有一些算法能够在大型质谱库中搜索光谱。在许多情况下,即使库中包含正确的化合物,库搜索程序也会返回错误答案。在这项工作中,我们提出了一种深度学习驱动的库搜索方法,以降低此类情况发生的概率。机器学习排序(学习排序)是一类进行对象比较(排序)的机器学习和深度学习算法。这项工作介绍了使用深度学习排序通过低分辨率电子电离质谱法鉴定小分子的方法。使用深度卷积神经网络,而不是像代表光谱的向量之间的点积或欧几里得距离这样的两个光谱的简单相似性度量。深度学习排序模型优于其他方法,根据所使用的数据集,能够将错误答案的比例(在排名第一时)降低9 - 23%。来自戈尔姆代谢组数据库、人类代谢组数据库和菲恩库的光谱用于测试该模型。

相似文献

1
Deep Learning Driven GC-MS Library Search and Its Application for Metabolomics.深度学习驱动的气相色谱-质谱联用库检索及其在代谢组学中的应用
Anal Chem. 2020 Sep 1;92(17):11818-11825. doi: 10.1021/acs.analchem.0c02082. Epub 2020 Aug 12.
2
Deep learning under mass-to-charge ratio pre-retrieval to realize electron ionization mass spectrometry library retrieval.基于质荷比预检索的深度学习实现电子电离质谱文库检索。
Rapid Commun Mass Spectrom. 2022 Dec 30;36(24):e9398. doi: 10.1002/rcm.9398.
3
Deep learning embedder method and tool for mass spectra similarity search.用于质谱相似性搜索的深度学习嵌入器方法和工具。
J Proteomics. 2021 Feb 10;232:104070. doi: 10.1016/j.jprot.2020.104070. Epub 2020 Dec 8.
4
How searching against multiple libraries can lead to biased results in GC/MS-based metabolomics.在基于气相色谱-质谱联用(GC/MS)的代谢组学中,对多个文库进行检索如何导致有偏差的结果。
Rapid Commun Mass Spectrom. 2023 Feb 15;37(3):e9437. doi: 10.1002/rcm.9437.
5
Steroid identification via deep learning retention time predictions and two-dimensional gas chromatography-high resolution mass spectrometry.基于深度学习保留时间预测的二维气相色谱-高分辨质谱法进行类固醇鉴定。
J Chromatogr A. 2020 Feb 8;1612:460661. doi: 10.1016/j.chroma.2019.460661. Epub 2019 Oct 28.
6
MS2CNN: predicting MS/MS spectrum based on protein sequence using deep convolutional neural networks.MS2CNN:基于深度卷积神经网络的蛋白质序列预测 MS/MS 谱。
BMC Genomics. 2019 Dec 24;20(Suppl 9):906. doi: 10.1186/s12864-019-6297-6.
7
Evaluation of gas chromatography-atmospheric pressure chemical ionization-mass spectrometry as an alternative to gas chromatography-electron ionization-mass spectrometry: avocado fruit as example.评价气相色谱-大气压化学电离质谱作为气相色谱-电子电离质谱的替代方法:以鳄梨果实为例。
J Chromatogr A. 2013 Oct 25;1313:228-44. doi: 10.1016/j.chroma.2013.08.084. Epub 2013 Aug 30.
8
Applying in-silico retention index and mass spectra matching for identification of unknown metabolites in accurate mass GC-TOF mass spectrometry.应用在线保留指数和质谱匹配鉴定精确质量 GC-TOF 质谱中的未知代谢物。
Anal Chem. 2011 Aug 1;83(15):5895-902. doi: 10.1021/ac2006137. Epub 2011 Jun 28.
9
FiehnLib: mass spectral and retention index libraries for metabolomics based on quadrupole and time-of-flight gas chromatography/mass spectrometry.FiehnLib:基于四极杆和飞行时间气相色谱/质谱的代谢组学质谱和保留指数库。
Anal Chem. 2009 Dec 15;81(24):10038-48. doi: 10.1021/ac9019522.
10
GCMS-ID: a webserver for identifying compounds from gas chromatography mass spectrometry experiments.GCMS-ID:一个用于从气相色谱质谱实验中鉴定化合物的网络服务器。
Nucleic Acids Res. 2024 Jul 5;52(W1):W381-W389. doi: 10.1093/nar/gkae425.

引用本文的文献

1
Fully Automated Deep Learning Enabled Miniature Mass Spectrometry System for Psychoactive Therapeutic Drug Monitoring.用于精神活性治疗药物监测的全自动化深度学习微型质谱系统
Adv Sci (Weinh). 2025 Aug;12(32):e02721. doi: 10.1002/advs.202502721. Epub 2025 May 23.
2
Advances in Food Aroma Analysis: Extraction, Separation, and Quantification Techniques.食品香气分析进展:提取、分离和定量技术
Foods. 2025 Apr 9;14(8):1302. doi: 10.3390/foods14081302.
3
Characterizing Families of Spectral Similarity Scores and Their Use Cases for Gas Chromatography-Mass Spectrometry Small Molecule Identification.
表征光谱相似性得分家族及其在气相色谱-质谱小分子鉴定中的应用案例
Metabolites. 2023 Oct 21;13(10):1101. doi: 10.3390/metabo13101101.
4
Highly accurate and large-scale collision cross sections prediction with graph neural networks.利用图神经网络进行高精度大规模碰撞截面预测。
Commun Chem. 2023 Jul 4;6(1):139. doi: 10.1038/s42004-023-00939-w.
5
Ultra-fast and accurate electron ionization mass spectrum matching for compound identification with million-scale in-silico library.超快速、准确的电子电离质谱匹配,用于具有百万级规模的化合物鉴定。
Nat Commun. 2023 Jun 22;14(1):3722. doi: 10.1038/s41467-023-39279-7.
6
Intelligent Workflow and Software for Non-Target Analysis of Complex Samples Using a Mixture of Toxic Transformation Products of Unsymmetrical Dimethylhydrazine as an Example.使用偏二甲肼毒性转化产物混合物作为实例的复杂样品非靶向分析智能工作流程和软件。
Molecules. 2023 Apr 12;28(8):3409. doi: 10.3390/molecules28083409.
7
Rapid Approximate Subset-Based Spectra Prediction for Electron Ionization-Mass Spectrometry.基于快速近似子集的电子电离质谱谱预测。
Anal Chem. 2023 Feb 7;95(5):2653-2663. doi: 10.1021/acs.analchem.2c02093. Epub 2023 Jan 25.
8
New software tools, databases, and resources in metabolomics: updates from 2020.代谢组学新软件工具、数据库和资源:2020 年的更新。
Metabolomics. 2021 May 11;17(5):49. doi: 10.1007/s11306-021-01796-1.