• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的高分辨率质谱数据全自动非约束分析。

Fully Automated Unconstrained Analysis of High-Resolution Mass Spectrometry Data with Machine Learning.

机构信息

Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Leninsky Prospekt 47, Moscow 119991, Russia.

出版信息

J Am Chem Soc. 2022 Aug 17;144(32):14590-14606. doi: 10.1021/jacs.2c03631. Epub 2022 Aug 8.

DOI:10.1021/jacs.2c03631
PMID:35939718
Abstract

Mass spectrometry (MS) is a convenient, highly sensitive, and reliable method for the analysis of complex mixtures, which is vital for materials science, life sciences fields such as metabolomics and proteomics, and mechanistic research in chemistry. Although it is one of the most powerful methods for individual compound detection, complete signal assignment in complex mixtures is still a great challenge. The unconstrained formula-generating algorithm, covering the entire spectra and revealing components, is a "dream tool" for researchers. We present the framework for efficient MS data interpretation, describing a novel approach for detailed analysis based on deisotoping performed by gradient-boosted decision trees and a neural network that generates molecular formulas from the fine isotopic structure, approaching the long-standing inverse spectral problem. The methods were successfully tested on three examples: fragment ion analysis in protein sequencing for proteomics, analysis of the natural samples for life sciences, and study of the cross-coupling catalytic system for chemistry.

摘要

质谱(MS)是一种用于分析复杂混合物的便捷、高灵敏度和可靠的方法,对材料科学、代谢组学和蛋白质组学等生命科学领域以及化学中的机制研究至关重要。尽管它是用于检测单个化合物的最强大的方法之一,但在复杂混合物中进行完整的信号分配仍然是一个巨大的挑战。无约束公式生成算法涵盖整个光谱并揭示成分,是研究人员的“梦想工具”。我们提出了一种有效的 MS 数据解释框架,描述了一种基于梯度提升决策树和神经网络的详细分析新方法,该方法通过对精细同位素结构进行去同位素处理来生成分子公式,从而解决长期存在的逆光谱问题。该方法在三个示例中得到了成功的测试:蛋白质组学中的蛋白质测序中碎片离子分析、生命科学中的天然样品分析以及化学中的交叉偶联催化体系研究。

相似文献

1
Fully Automated Unconstrained Analysis of High-Resolution Mass Spectrometry Data with Machine Learning.基于机器学习的高分辨率质谱数据全自动非约束分析。
J Am Chem Soc. 2022 Aug 17;144(32):14590-14606. doi: 10.1021/jacs.2c03631. Epub 2022 Aug 8.
2
Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学:基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍
3
mineXpert: Biological Mass Spectrometry Data Visualization and Mining with Full JavaScript Ability.mineXpert:具有完整 JavaScript 能力的生物质谱数据可视化和挖掘。
J Proteome Res. 2019 May 3;18(5):2254-2259. doi: 10.1021/acs.jproteome.9b00099. Epub 2019 Apr 17.
4
Metabolite identification using automated comparison of high-resolution multistage mass spectral trees.使用高分辨多级质谱树的自动比较进行代谢产物鉴定。
Anal Chem. 2012 Jul 3;84(13):5524-34. doi: 10.1021/ac2034216. Epub 2012 Jun 22.
5
[Analysis of metabolomics and proteomics based on capillary electrophoresis-mass spectrometry].基于毛细管电泳-质谱联用的代谢组学和蛋白质组学分析
Se Pu. 2020 Sep 8;38(9):1013-1021. doi: 10.3724/SP.J.1123.2020.02025.
6
MetaClean: a machine learning-based classifier for reduced false positive peak detection in untargeted LC-MS metabolomics data.MetaClean:一种基于机器学习的分类器,用于降低非靶向 LC-MS 代谢组学数据中假阳性峰的检测率。
Metabolomics. 2020 Oct 21;16(11):117. doi: 10.1007/s11306-020-01738-3.
7
MultiAlign: a multiple LC-MS analysis tool for targeted omics analysis.MultiAlign:一种用于靶向组学分析的多重 LC-MS 分析工具。
BMC Bioinformatics. 2013 Feb 12;14:49. doi: 10.1186/1471-2105-14-49.
8
Correcting for the effects of natural abundance in stable isotope resolved metabolomics experiments involving ultra-high resolution mass spectrometry.在涉及超高分辨率质谱的稳定同位素解析代谢组学实验中,校正自然丰度的影响。
BMC Bioinformatics. 2010 Mar 17;11:139. doi: 10.1186/1471-2105-11-139.
9
Retention time alignment of LC/MS data by a divide-and-conquer algorithm.采用分而治之算法的 LC/MS 数据保留时间对齐。
J Am Soc Mass Spectrom. 2012 Apr;23(4):764-72. doi: 10.1007/s13361-011-0334-2.
10
Intensity-based protein identification by machine learning from a library of tandem mass spectra.基于强度的蛋白质鉴定:通过机器学习从串联质谱库中进行
Nat Biotechnol. 2004 Feb;22(2):214-9. doi: 10.1038/nbt930. Epub 2004 Jan 18.

引用本文的文献

1
Deep-learning-enabled online mass spectrometry of the reaction product of a single catalyst nanoparticle.基于深度学习的单个催化剂纳米颗粒反应产物的在线质谱分析。
Nat Commun. 2025 Aug 5;16(1):7203. doi: 10.1038/s41467-025-62602-3.
2
A Fast Neural Network for Isotopic Charge State Assignment.一种用于同位素电荷态分配的快速神经网络。
J Am Chem Soc. 2025 Jun 25;147(25):21610-21620. doi: 10.1021/jacs.5c03162. Epub 2025 Jun 10.
3
Discovering organic reactions with a machine-learning-powered deciphering of tera-scale mass spectrometry data.
通过机器学习驱动的太赫兹级质谱数据解析发现有机反应。
Nat Commun. 2025 Mar 16;16(1):2587. doi: 10.1038/s41467-025-56905-8.
4
Evaluation of the current methods for assigning molecular formulas to dissolved organic matter using high resolution mass spectrometry.使用高分辨率质谱法评估当前为溶解有机物分配分子式的方法。
Sci Rep. 2025 Mar 8;15(1):8105. doi: 10.1038/s41598-025-87539-x.
5
Mass Spectrometry Advancements and Applications for Biomarker Discovery, Diagnostic Innovations, and Personalized Medicine.质谱技术在生物标志物发现、诊断创新和个性化医学中的应用和进展。
Int J Mol Sci. 2024 Sep 12;25(18):9880. doi: 10.3390/ijms25189880.
6
Recent Developments in Machine Learning for Mass Spectrometry.用于质谱分析的机器学习的最新进展
ACS Meas Sci Au. 2024 Feb 21;4(3):233-246. doi: 10.1021/acsmeasuresciau.3c00060. eCollection 2024 Jun 19.
7
Emerging contaminants: A One Health perspective.新兴污染物:“同一健康”视角
Innovation (Camb). 2024 Mar 13;5(4):100612. doi: 10.1016/j.xinn.2024.100612. eCollection 2024 Jul 1.
8
Imaging plant metabolism in situ.在体成像植物代谢。
J Exp Bot. 2024 Mar 14;75(6):1654-1670. doi: 10.1093/jxb/erad423.
9
Characterizing lignins from various sources and treatment processes after optimized sample preparation techniques and analysis via ESI-HRMS and custom mass defect software tools.在优化样品制备技术后,通过电喷雾电离高分辨率质谱(ESI-HRMS)和定制质量缺陷软件工具对来自各种来源和处理过程的木质素进行表征分析。
Anal Bioanal Chem. 2023 Nov;415(27):6663-6675. doi: 10.1007/s00216-023-04942-x. Epub 2023 Sep 16.
10
Ultra-fast and accurate electron ionization mass spectrum matching for compound identification with million-scale in-silico library.超快速、准确的电子电离质谱匹配,用于具有百万级规模的化合物鉴定。
Nat Commun. 2023 Jun 22;14(1):3722. doi: 10.1038/s41467-023-39279-7.