IRCCS Centro Neurolesi "Bonino-Pulejo", Via Provinciale Palermo, Contrada Casazza, 98124 Messina, Italy.
Int J Mol Sci. 2022 May 7;23(9):5237. doi: 10.3390/ijms23095237.
Alzheimer's disease (AD) is an incurable neurodegenerative disease diagnosed by clinicians through healthcare records and neuroimaging techniques. These methods lack sensitivity and specificity, so new antemortem non-invasive strategies to diagnose AD are needed. Herein, we designed a machine learning predictor based on transcriptomic data obtained from the blood of AD patients and individuals without dementia (non-AD) through an 8 × 60 K microarray. The dataset was used to train different models with different hyperparameters. The support vector machines method allowed us to reach a Receiver Operating Characteristic score of 93% and an accuracy of 89%. High score levels were also achieved by the neural network and logistic regression methods. Furthermore, the Gene Ontology enrichment analysis of the features selected to train the model along with the genes differentially expressed between the non-AD and AD transcriptomic profiles shows the "mitochondrial translation" biological process to be the most interesting. In addition, inspection of the KEGG pathways suggests that the accumulation of β-amyloid triggers electron transport chain impairment, enhancement of reactive oxygen species and endoplasmic reticulum stress. Taken together, all these elements suggest that the oxidative stress induced by β-amyloid is a key feature trained by the model for the prediction of AD with high accuracy.
阿尔茨海默病(AD)是一种无法治愈的神经退行性疾病,临床医生通过医疗记录和神经影像学技术进行诊断。这些方法缺乏敏感性和特异性,因此需要新的、非侵入性的 AD 生前诊断策略。在此,我们设计了一种基于从 AD 患者和无痴呆(非 AD)个体的血液中获得的转录组数据的机器学习预测器,这些数据是通过 8×60 K 微阵列获得的。该数据集用于训练不同的模型,每个模型的超参数不同。支持向量机方法使我们能够达到 93%的接收器操作特征评分和 89%的准确性。神经网络和逻辑回归方法也取得了较高的分数水平。此外,对用于训练模型的特征以及非 AD 和 AD 转录组图谱之间差异表达的基因进行的基因本体论富集分析表明,“线粒体翻译”生物过程最有趣。此外,对 KEGG 途径的检查表明,β-淀粉样蛋白的积累会触发电子传递链损伤、活性氧和内质网应激的增强。综上所述,所有这些元素都表明,β-淀粉样蛋白诱导的氧化应激是该模型对 AD 进行高精度预测所训练的关键特征。