利用机器学习推进串联质谱（MS/MS）谱图预测

Advancing the Prediction of MS/MS Spectra Using Machine Learning.

作者信息

Nguyen Julia, Overstreet Richard, King Ethan, Ciesielski Danielle

机构信息

Computing and Analytics Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States.

Signature Science and Technology Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States.

出版信息

J Am Soc Mass Spectrom. 2024 Oct 2;35(10):2256-2266. doi: 10.1021/jasms.4c00154. Epub 2024 Sep 11.

DOI:10.1021/jasms.4c00154

PMID:39258761

Abstract

Tandem mass spectrometry (MS/MS) is an important tool for the identification of small molecules and metabolites where resultant spectra are most commonly identified by matching them with spectra in MS/MS reference libraries. While popular, this strategy is limited by the contents of existing reference libraries. In response to this limitation, various methods are being developed for the generation of spectra to augment existing libraries. Recently, machine learning and deep learning techniques have been applied to predict spectra with greater speed and accuracy. Here, we investigate the challenges these algorithms face in achieving fast and accurate predictions on a wide range of small molecules. The challenges are often amplified by the use of generic machine learning benchmarking tactics, which lead to misleading accuracy scores. Curating data sets, only predicting spectra for sufficiently high collision energies, and working more closely with experimental mass spectrometrists are recommended strategies to improve overall prediction accuracy in this nuanced field.

摘要

串联质谱法（MS/MS）是鉴定小分子和代谢物的重要工具，所得光谱最常见的鉴定方法是将其与MS/MS参考库中的光谱进行匹配。虽然这种方法很流行，但它受到现有参考库内容的限制。为应对这一限制，人们正在开发各种生成光谱的方法以扩充现有库。最近，机器学习和深度学习技术已被应用于以更高的速度和准确性预测光谱。在此，我们研究了这些算法在对广泛的小分子进行快速准确预测时所面临的挑战。使用通用的机器学习基准测试策略往往会放大这些挑战，从而导致误导性的准确性分数。精心策划数据集、仅对足够高的碰撞能量预测光谱以及与实验质谱专家更紧密合作是在这个细微领域提高整体预测准确性的推荐策略。

相似文献

Advancing the Prediction of MS/MS Spectra Using Machine Learning.利用机器学习推进串联质谱（MS/MS）谱图预测

J Am Soc Mass Spectrom. 2024 Oct 2;35(10):2256-2266. doi: 10.1021/jasms.4c00154. Epub 2024 Sep 11.

Augmentation of MS/MS Libraries with Spectral Interpolation for Improved Identification.利用光谱内插增强 MS/MS 文库以提高鉴定能力。

J Chem Inf Model. 2022 Aug 22;62(16):3724-3733. doi: 10.1021/acs.jcim.2c00620. Epub 2022 Jul 29.

How Well Can We Predict Mass Spectra from Structures? Benchmarking Competitive Fragmentation Modeling for Metabolite Identification on Untrained Tandem Mass Spectra.从结构上预测质谱的能力如何？在未经训练的串联质谱上对代谢物鉴定进行竞争碎片建模的基准测试。

J Chem Inf Model. 2022 Sep 12;62(17):4049-4056. doi: 10.1021/acs.jcim.2c00936. Epub 2022 Aug 31.

Predicting Collision-Induced-Dissociation Tandem Mass Spectra (CID-MS/MS) Using Ab Initio Molecular Dynamics.使用从头算分子动力学预测碰撞诱导解离串联质谱（CID-MS/MS）。

J Chem Inf Model. 2024 Oct 14;64(19):7470-7487. doi: 10.1021/acs.jcim.4c00760. Epub 2024 Sep 27.

Metabolomic spectral libraries for data-independent SWATH liquid chromatography mass spectrometry acquisition.用于数据非依赖型SWATH液相色谱质谱采集的代谢组学光谱库。

Anal Bioanal Chem. 2018 Mar;410(7):1873-1884. doi: 10.1007/s00216-018-0860-x. Epub 2018 Feb 6.

Machine learning for identification of silylated derivatives from mass spectra.用于从质谱图中识别硅烷化衍生物的机器学习

J Cheminform. 2022 Sep 15;14(1):62. doi: 10.1186/s13321-022-00636-1.

Identification of small molecules using accurate mass MS/MS search.利用精确质量 MS/MS 搜索鉴定小分子。

Mass Spectrom Rev. 2018 Jul;37(4):513-532. doi: 10.1002/mas.21535. Epub 2017 Apr 24.

CFM-ID 4.0 - a web server for accurate MS-based metabolite identification.CFM-ID 4.0——一个用于准确基于 MS 的代谢物鉴定的网络服务器。

Nucleic Acids Res. 2022 Jul 5;50(W1):W165-W174. doi: 10.1093/nar/gkac383.

Deep learning embedder method and tool for mass spectra similarity search.用于质谱相似性搜索的深度学习嵌入器方法和工具。

J Proteomics. 2021 Feb 10;232:104070. doi: 10.1016/j.jprot.2020.104070. Epub 2020 Dec 8.

Machine Learning in Small-Molecule Mass Spectrometry.

Annu Rev Anal Chem (Palo Alto Calif). 2025 May;18(1):193-215. doi: 10.1146/annurev-anchem-071224-082157. Epub 2025 Feb 27.

引用本文的文献

mineMS2: annotation of spectral libraries with exact fragmentation patterns.mineMS2：使用精确的碎片模式对光谱库进行注释。

J Cheminform. 2025 Jul 24;17(1):111. doi: 10.1186/s13321-025-01051-y.

Neural Spectral Prediction for Structure Elucidation with Tandem Mass Spectrometry.用于串联质谱结构解析的神经光谱预测

bioRxiv. 2025 Jun 1:2025.05.28.656653. doi: 10.1101/2025.05.28.656653.

The Enzyme Effect: Broadening the Horizon of MS Optimization to Nontryptic Digestion in Proteomics.酶效应：将质谱优化的视野拓展至蛋白质组学中的非胰蛋白酶消化

J Am Soc Mass Spectrom. 2025 Feb 5;36(2):299-308. doi: 10.1021/jasms.4c00396. Epub 2025 Jan 13.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用机器学习推进串联质谱（MS/MS）谱图预测

Advancing the Prediction of MS/MS Spectra Using Machine Learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献