Suppr超能文献

基于分子式推导脂质分类。

Deriving Lipid Classification Based on Molecular Formulas.

作者信息

Mitchell Joshua M, Flight Robert M, Moseley Hunter N B

机构信息

Department of Molecular & Cellular Biochemistry, University of Kentucky, Lexington, KY 40536, USA.

Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA.

出版信息

Metabolites. 2020 Mar 24;10(3):122. doi: 10.3390/metabo10030122.

Abstract

Despite instrument and algorithmic improvements, the untargeted and accurate assignment of metabolites remains an unsolved problem in metabolomics. New assignment methods such as our SMIRFE algorithm can assign elemental molecular formulas to observed spectral features in a highly untargeted manner without orthogonal information from tandem MS or chromatography. However, for many lipidomics applications, it is necessary to know at least the lipid category or class that is associated with a detected spectral feature to derive a biochemical interpretation. Our goal is to develop a method for robustly classifying elemental molecular formula assignments into lipid categories for an application to SMIRFE-generated assignments. Using a Random Forest machine learning approach, we developed a method that can predict lipid category and class from SMIRFE non-adducted molecular formula assignments. Our methods achieve high average predictive accuracy (>90%) and precision (>83%) across all eight of the lipid categories in the LIPIDMAPS database. Classification performance was evaluated using sets of theoretical, data-derived, and artifactual molecular formulas. Our methods enable the lipid classification of non-adducted molecular formula assignments generated by SMIRFE without orthogonal information, facilitating the biochemical interpretation of untargeted lipidomics experiments. This lipid classification appears insufficient for validating single-spectrum assignments, but could be useful in cross-spectrum assignment validation.

摘要

尽管仪器和算法有所改进,但在代谢组学中,代谢物的非靶向和准确归属仍然是一个未解决的问题。像我们的SMIRFE算法这样的新归属方法,可以以高度非靶向的方式为观察到的光谱特征分配元素分子式,而无需串联质谱或色谱的正交信息。然而,对于许多脂质组学应用而言,为了进行生化解释,有必要至少知道与检测到的光谱特征相关的脂质类别。我们的目标是开发一种方法,将元素分子式归属稳健地分类到脂质类别中,以应用于SMIRFE生成的归属。我们使用随机森林机器学习方法,开发了一种可以从SMIRFE非加合分子式归属预测脂质类别和类别的方法。我们的方法在LIPIDMAPS数据库的所有八个脂质类别中均实现了较高的平均预测准确率(>90%)和精确率(>83%)。使用理论、数据衍生和人工合成分子式集对分类性能进行了评估。我们的方法能够对SMIRFE在没有正交信息的情况下生成的非加合分子式归属进行脂质分类,有助于对非靶向脂质组学实验进行生化解释。这种脂质分类对于验证单光谱归属似乎不够充分,但在交叉光谱归属验证中可能有用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5199/7143220/bfc5d9cdc24e/metabolites-10-00122-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验