Suppr超能文献

绘制质谱预测的边界:对机器学习用于异源氨基酸的电子电离质谱预测的评估

Mapping the Edges of Mass Spectral Prediction: Evaluation of Machine Learning EIMS Prediction for Xeno Amino Acids.

作者信息

Brown Sean M, Allgair Evan, Kryštůfek Robin

机构信息

Department of Biological Sciences, University of Maryland, Baltimore County, Baltimore, Maryland 21250, United States.

Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, 160 00 Praha 6-Dejvice, Czechia.

出版信息

Anal Chem. 2025 May 20;97(19):10282-10288. doi: 10.1021/acs.analchem.5c00286. Epub 2025 May 7.

Abstract

Mass spectrometry is one of the most effective analytical methods for unknown compound identification. By comparing observed / spectra with a database of experimentally determined spectra, this process identifies compound(s) in any given sample. Unknown sample identification is thus limited to whatever has been experimentally determined. To address the reliance on experimentally determined signatures, multiple state-of-the-art MS spectra prediction algorithms have been developed within the past half decade. Here we evaluate the accuracy of the NEIMS spectral prediction algorithm. We focus our analyses on monosubstituted α-amino acids given their significance as important targets for astrobiology, synthetic biology, and diverse biomedical applications. Our general intent is to inform those using generated spectra for detection of unknown biomolecules. We find predicted spectra are inaccurate for amino acids beyond the algorithms training data. Interestingly, these inaccuracies are not explained by physicochemical differences or the derivatization state of the amino acids measured. We thus highlight the need to improve both current machine learning based approaches and further optimization of spectral prediction algorithms so as to expand databases for structures beyond what is currently experimentally possible, even including theoretical molecules.

摘要

质谱分析是鉴定未知化合物最有效的分析方法之一。通过将观察到的光谱与实验测定光谱的数据库进行比较,该过程可识别任何给定样品中的化合物。因此,未知样品的鉴定仅限于已通过实验确定的物质。为了解决对实验确定特征的依赖,在过去五年中开发了多种最先进的质谱光谱预测算法。在此,我们评估了NEIMS光谱预测算法的准确性。鉴于单取代α-氨基酸作为天体生物学、合成生物学和各种生物医学应用的重要目标的重要性,我们将分析重点放在了它们身上。我们的总体目的是为那些使用生成的光谱检测未知生物分子的人提供信息。我们发现,对于超出算法训练数据范围的氨基酸,预测光谱不准确。有趣的是,这些不准确之处无法通过所测氨基酸的物理化学差异或衍生化状态来解释。因此,我们强调需要改进当前基于机器学习的方法,并进一步优化光谱预测算法,以便扩展目前实验上无法实现的结构数据库,甚至包括理论分子。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9ce/12096351/0e144492993c/ac5c00286_0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验