• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于串联质谱结构解析的神经光谱预测

Neural Spectral Prediction for Structure Elucidation with Tandem Mass Spectrometry.

作者信息

Wang Runzhong, Manjrekar Mrunali, Mahjour Babak, Avila-Pacheco Julian, Provenzano Joules, Reynolds Erin, Lederbauer Magdalena, Mashin Eivgeni, Goldman Samuel, Wang Mingxun, Weng Jing-Ke, Plata Desirée L, Clish Clary B, Coley Connor W

机构信息

Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, 02139, MA, United States.

Broad Institute of MIT and Harvard, 415 Main St, Cambridge, 02142, MA, United States.

出版信息

bioRxiv. 2025 Jun 1:2025.05.28.656653. doi: 10.1101/2025.05.28.656653.

DOI:10.1101/2025.05.28.656653
PMID:40501725
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12154671/
Abstract

Structural elucidation using untargeted tandem mass spectrometry (MS/MS) has played a critical role in advancing scientific discovery [1, 2]. However, differentiating molecular fragmentation patterns between isobaric structures remains a prominent challenge in metabolomics [3-10], drug discovery [11-13], and reaction screening [14-17], presenting a significant barrier to the cost-effective and rapid identification of unknown molecular structures. Here, we present a geometric deep learning model, ICEBERG, that simulates collision-induced dissociation in mass spectrometry to generate chemically plausible fragments and their relative intensities with awareness of collision energies and polarities. We utilize ICEBERG predictions to facilitate structure elucidation by ranking a set of candidate structures based on the similarity between their predicted MS/MS spectra and an experimental MS/MS spectrum of interest. This integrated elucidation pipeline enables state-of-the-art performance in compound annotation, with 40% top-1 accuracy on the NIST'20 [M+H] adduct subset and with 92% of correct structures appearing in the top ten predictions in the same dataset. We demonstrate several real-world case studies, including identifying clinical biomarkers of depression and tuberculous meningitis, annotating an aqueous abiotic degradation product of the pesticide thiophanate methyl, disambiguating isobaric products in pooled reaction screening, and annotating biosynthetic pathways in . Overall, this deep learning-based, chemically-interpretable paradigm for structural elucidation enables rapid molecular annotation from complex mixtures, driving discoveries across diverse scientific domains.

摘要

使用非靶向串联质谱(MS/MS)进行结构解析在推动科学发现方面发挥了关键作用[1,2]。然而,在代谢组学[3-10]、药物发现[11-13]和反应筛选[14-17]中,区分等压结构之间的分子碎片化模式仍然是一个突出的挑战,这对经济高效且快速地鉴定未知分子结构构成了重大障碍。在此,我们提出了一种几何深度学习模型ICEBERG,它模拟质谱中的碰撞诱导解离,以生成具有化学合理性的碎片及其相对强度,并考虑碰撞能量和极性。我们利用ICEBERG的预测结果,通过根据一组候选结构的预测MS/MS谱与感兴趣的实验MS/MS谱之间的相似性对它们进行排序,来促进结构解析。这种集成的解析流程在化合物注释方面实现了先进的性能,在NIST'20 [M+H]加合物子集上的 top-1准确率为40%,并且在同一数据集中92%的正确结构出现在前十的预测结果中。我们展示了几个实际案例研究,包括鉴定抑郁症和结核性脑膜炎的临床生物标志物、注释农药甲基托布津的水相非生物降解产物、在混合反应筛选中区分等压产物以及注释生物合成途径。总体而言,这种基于深度学习的、具有化学可解释性的结构解析范式能够从复杂混合物中快速进行分子注释,推动跨多个科学领域的发现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/a1f40504c38c/nihpp-2025.05.28.656653v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/3cef27078049/nihpp-2025.05.28.656653v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/d6cbca260cae/nihpp-2025.05.28.656653v1-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/d9650ba2d70b/nihpp-2025.05.28.656653v1-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/d5f7c93bbdd4/nihpp-2025.05.28.656653v1-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/3d15959467f6/nihpp-2025.05.28.656653v1-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/d1742ff46724/nihpp-2025.05.28.656653v1-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/45776ce30d15/nihpp-2025.05.28.656653v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/e14f7849ae03/nihpp-2025.05.28.656653v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/223c33a9c9fb/nihpp-2025.05.28.656653v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/f1634a260da6/nihpp-2025.05.28.656653v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/a1f40504c38c/nihpp-2025.05.28.656653v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/3cef27078049/nihpp-2025.05.28.656653v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/d6cbca260cae/nihpp-2025.05.28.656653v1-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/d9650ba2d70b/nihpp-2025.05.28.656653v1-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/d5f7c93bbdd4/nihpp-2025.05.28.656653v1-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/3d15959467f6/nihpp-2025.05.28.656653v1-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/d1742ff46724/nihpp-2025.05.28.656653v1-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/45776ce30d15/nihpp-2025.05.28.656653v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/e14f7849ae03/nihpp-2025.05.28.656653v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/223c33a9c9fb/nihpp-2025.05.28.656653v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/f1634a260da6/nihpp-2025.05.28.656653v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c960/12154671/a1f40504c38c/nihpp-2025.05.28.656653v1-f0005.jpg

相似文献

1
Neural Spectral Prediction for Structure Elucidation with Tandem Mass Spectrometry.用于串联质谱结构解析的神经光谱预测
bioRxiv. 2025 Jun 1:2025.05.28.656653. doi: 10.1101/2025.05.28.656653.
2
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
3
Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.染色体臂 1p 和 19q 缺失的检测在胶质瘤患者中的诊断准确性和成本效益。
Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.
4
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.
5
Effective data visualization strategies in untargeted metabolomics.非靶向代谢组学中的有效数据可视化策略
Nat Prod Rep. 2024 Dec 2. doi: 10.1039/d4np00039k.
6
Prediction of Anti-rheumatoid Arthritis Natural Products of Xanthocerais Lignum Based on LC-MS and Artificial Intelligence.基于液相色谱-质谱联用技术和人工智能的抗类风湿性关节炎天然产物黄羊角木预测
Comb Chem High Throughput Screen. 2025;28(4):627-646. doi: 10.2174/0113862073282138240116112348.
7
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
8
Immunogenicity and seroefficacy of pneumococcal conjugate vaccines: a systematic review and network meta-analysis.肺炎球菌结合疫苗的免疫原性和血清效力:系统评价和网络荟萃分析。
Health Technol Assess. 2024 Jul;28(34):1-109. doi: 10.3310/YWHA3079.
9
Intravenous magnesium sulphate and sotalol for prevention of atrial fibrillation after coronary artery bypass surgery: a systematic review and economic evaluation.静脉注射硫酸镁和索他洛尔预防冠状动脉搭桥术后房颤:系统评价与经济学评估
Health Technol Assess. 2008 Jun;12(28):iii-iv, ix-95. doi: 10.3310/hta12280.
10
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of topotecan for ovarian cancer.拓扑替康治疗卵巢癌的临床有效性和成本效益的快速系统评价。
Health Technol Assess. 2001;5(28):1-110. doi: 10.3310/hta5280.

本文引用的文献

1
Designing for degradation: the importance of considering biotic and abiotic polymer degradation.面向降解的设计:考虑生物和非生物聚合物降解的重要性。
Environ Sci Process Impacts. 2025 May 21;27(5):1303-1316. doi: 10.1039/d5em00079c.
2
Denoising Search doubles the number of metabolite and exposome annotations in human plasma using an Orbitrap Astral mass spectrometer.去噪搜索使用轨道阱星体质谱仪使人类血浆中代谢物和暴露组注释的数量增加了一倍。
Nat Methods. 2025 May;22(5):1008-1016. doi: 10.1038/s41592-025-02646-x. Epub 2025 Mar 28.
3
Discovering organic reactions with a machine-learning-powered deciphering of tera-scale mass spectrometry data.
通过机器学习驱动的太赫兹级质谱数据解析发现有机反应。
Nat Commun. 2025 Mar 16;16(1):2587. doi: 10.1038/s41467-025-56905-8.
4
Prohormone cleavage prediction uncovers a non-incretin anti-obesity peptide.激素原裂解预测揭示了一种非肠促胰岛素抗肥胖肽。
Nature. 2025 May;641(8061):192-201. doi: 10.1038/s41586-025-08683-y. Epub 2025 Mar 5.
5
Discovery of metabolites prevails amid in-source fragmentation.代谢物的发现主要发生在源内裂解过程中。
Nat Metab. 2025 Mar;7(3):435-437. doi: 10.1038/s42255-025-01239-4.
6
Persistent PTSD symptoms are associated with plasma metabolic alterations relevant to long-term health: A metabolome-wide investigation in women.持续性创伤后应激障碍症状与长期健康相关的血浆代谢改变有关:一项针对女性的全代谢组研究。
Psychol Med. 2025 Feb 10;55:e30. doi: 10.1017/S0033291724003374.
7
Continuous collective analysis of chemical reactions.化学反应的连续集体分析
Nature. 2024 Dec;636(8042):374-379. doi: 10.1038/s41586-024-08211-4. Epub 2024 Dec 11.
8
PubChem 2025 update.PubChem 2025更新版。
Nucleic Acids Res. 2025 Jan 6;53(D1):D1516-D1525. doi: 10.1093/nar/gkae1059.
9
Statistical analysis of feature-based molecular networking results from non-targeted metabolomics data.基于特征的非靶向代谢组学数据分子网络结果的统计分析
Nat Protoc. 2025 Jan;20(1):92-162. doi: 10.1038/s41596-024-01046-3. Epub 2024 Sep 20.
10
Commensal consortia decolonize Enterobacteriaceae via ecological control.共生联合体通过生态控制使肠杆菌科去殖民化。
Nature. 2024 Sep;633(8031):878-886. doi: 10.1038/s41586-024-07960-6. Epub 2024 Sep 18.