Anal Chem. 2019 Mar 5;91(5):3246-3253. doi: 10.1021/acs.analchem.8b03126. Epub 2019 Feb 11.
Computational metabolite annotation in untargeted profiling aims at uncovering neutral molecular masses of underlying metabolites and assign those with putative identities. Existing annotation strategies rely on the observation and annotation of adducts to determine metabolite neutral masses. However, a significant fraction of features usually detected in untargeted experiments remains unannotated, which limits our ability to determine neutral molecular masses. Despite the availability of tools to annotate, relatively few of them benefit from the inherent presence of in-source fragments in liquid chromatography-electrospray ionization-mass spectrometry. In this study, we introduce a strategy to annotate in-source fragments in untargeted data using low-energy tandem mass spectrometry (MS) spectra from the METLIN library. Our algorithm, MISA (METLIN-guided in-source annotation), compares detected features against low-energy fragments from MS/MS spectra, enabling robust annotation and putative identification of metabolic features based on low-energy spectral matching. The algorithm was evaluated through an annotation analysis of a total of 140 metabolites across three different sets of biological samples analyzed with liquid chromatography-mass spectrometry. Results showed that, in cases where adducts were not formed or detected, MISA was able to uncover neutral molecular masses by in-source fragment matching. MISA was also able to provide putative metabolite identities via two annotation scores. These scores take into account the number of in-source fragments matched and the relative intensity similarity between the experimental data and the reference low-energy MS/MS spectra. Overall, results showed that in-source fragmentation is a highly frequent phenomena that should be considered for comprehensive feature annotation. Thus, combined with adduct annotation, this strategy adds a complementary annotation layer, enabling in-source fragments to be annotated and increasing putative identification confidence. The algorithm is integrated into the XCMS Online platform and is freely available at http://xcmsonline.scripps.edu .
非靶向谱分析中的计算代谢物注释旨在揭示潜在代谢物的中性分子量,并对具有假定身份的代谢物进行注释。现有的注释策略依赖于加合物的观察和注释来确定代谢物的中性质量。然而,在非靶向实验中通常检测到的特征中有很大一部分仍然未被注释,这限制了我们确定中性分子量的能力。尽管有工具可以进行注释,但相对较少的工具受益于液相色谱-电喷雾电离-质谱中固有的源内碎片的存在。在这项研究中,我们介绍了一种使用 METLIN 库中的低能串联质谱 (MS) 谱图对非靶向数据中的源内碎片进行注释的策略。我们的算法 MISA(基于 METLIN 的源内注释)将检测到的特征与 MS/MS 谱图的低能碎片进行比较,从而能够基于低能谱匹配对代谢物特征进行稳健注释和假定鉴定。该算法通过对 140 种不同代谢物在三个不同生物样本组的液相色谱-质谱分析中的注释分析进行了评估。结果表明,在未形成或未检测到加合物的情况下,MISA 能够通过源内碎片匹配揭示中性分子量。MISA 还能够通过两种注释分数提供假定的代谢物身份。这些分数考虑了匹配的源内碎片数量以及实验数据与参考低能 MS/MS 谱之间的相对强度相似性。总的来说,结果表明源内碎片是一种非常常见的现象,应该考虑用于全面的特征注释。因此,与加合物注释相结合,这种策略增加了一个补充注释层,能够对源内碎片进行注释,并提高假定鉴定的置信度。该算法已集成到 XCMS Online 平台中,并可在 http://xcmsonline.scripps.edu 免费获得。