Suppr超能文献

OPLS 判别分析在癌症组织脂质组学数据中的特征选择。

Feature selection for OPLS discriminant analysis of cancer tissue lipidomics data.

机构信息

Department of molecular and chemical physics, Moscow Institute of Physics and Technology, Moscow, Russia.

V.L. Talrose Institute for Energy Problems of Chemical Physics, N.N. Semenov Federal Center of Chemical Physic, Russian Academy of Sciences, Moscow, Russia.

出版信息

J Mass Spectrom. 2020 Jan;55(1):e4457. doi: 10.1002/jms.4457. Epub 2019 Dec 9.

Abstract

The mass spectrometry-based molecular profiling can be used for better differentiation between normal and cancer tissues and for the detection of neoplastic transformation, which is of great importance for diagnostics of a pathology, prognosis of its evolution trend, and development of a treatment strategy. The aim of the present study is the evaluation of tissue classification approaches based on various data sets derived from the molecular profile of the organic solvent extracts of a tissue. A set of possibilities are considered for the orthogonal projections to latent structures discriminant analysis: all mass spectrometric peaks over 300 counts threshold, subset of peaks selected by ranking with support vector machine algorithm, peaks selected by random forest algorithm, peaks with the statistically significant difference of the intensity determined by the Mann-Whitney U test, peaks identified as lipids, and both identified and significantly different peaks. The best predictive potential is obtained for OPLS-DA model built on nonpolar glycerolipids (Q = 0.64, area under curve [AUC] = 0.95); the second one is OPLS-DA model with lipid peaks selected by random forest algorithm (Q = 0.58, AUC = 0.87). Moreover, models based on particular molecular classes are more preferable from biological point of view, resulting in new explanatory mechanisms of pathophysiology and providing a pathway analysis. Another promising features for OPLS-DA modeling are phosphatidylethanolamines (Q = 0.48, AUC = 0.86).

摘要

基于质谱的分子特征分析可用于更好地区分正常组织和癌症组织,以及检测肿瘤转化,这对病理诊断、预测其演化趋势和制定治疗策略具有重要意义。本研究旨在评估基于组织有机溶剂提取物分子特征的各种数据集的组织分类方法。正交投影到潜在结构判别分析(OPLS-DA)考虑了一系列可能性:所有计数超过 300 的质谱峰、通过支持向量机算法排序选择的峰子集、通过随机森林算法选择的峰、通过曼-惠特尼 U 检验确定强度有统计学差异的峰、鉴定为脂质的峰以及鉴定和差异显著的峰。建立在非极性甘油脂上的 OPLS-DA 模型具有最佳的预测潜力(Q = 0.64,曲线下面积 [AUC] = 0.95);其次是通过随机森林算法选择脂质峰的 OPLS-DA 模型(Q = 0.58,AUC = 0.87)。此外,从生物学角度来看,基于特定分子类别的模型更可取,从而为病理生理学提供新的解释机制,并进行途径分析。OPLS-DA 建模的另一个有前途的特征是磷脂酰乙醇胺(Q = 0.48,AUC = 0.86)。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验