Laboratory of Integrative Genomics, Department of Integrative Biology, School of BioSciences and Technology, Vellore Institute of Technology (VIT), Vellore, 632014, Tamil Nadu, India.
Department of Genome Mapping, Molecular Genetics and Genome Mapping Laboratory, Agricultural Genetic Engineering Research Institute, Giza, Egypt.
Metab Brain Dis. 2023 Apr;38(4):1297-1310. doi: 10.1007/s11011-023-01171-0. Epub 2023 Feb 21.
The progressive, chronic nature of Alzheimer's disease (AD), a form of dementia, defaces the adulthood of elderly individuals. The pathogenesis of the condition is primarily unascertained, turning the treatment efficacy more arduous. Therefore, understanding the genetic etiology of AD is essential to identifying targeted therapeutics. This study aimed to use machine-learning techniques of expressed genes in patients with AD to identify potential biomarkers that can be used for future therapy. The dataset is accessed from the Gene Expression Omnibus (GEO) database (Accession Number: GSE36980). The subgroups (AD blood samples from frontal, hippocampal, and temporal regions) are individually investigated against non-AD models. Prioritized gene cluster analyses are conducted with the STRING database. The candidate gene biomarkers were trained with various supervised machine-learning (ML) classification algorithms. The interpretation of the model prediction is perpetrated with explainable artificial intelligence (AI) techniques. This experiment revealed 34, 60, and 28 genes as target biomarkers of AD mapped from the frontal, hippocampal, and temporal regions. It is identified ORAI2 as a shared biomarker in all three areas strongly associated with AD's progression. The pathway analysis showed that STIM1 and TRPC3 are strongly associated with ORAI2. We found three hub genes, TPI1, STIM1, and TRPC3, in the network of the ORAI2 gene that might be involved in the molecular pathogenesis of AD. Naive Bayes classified the samples of different groups by fivefold cross-validation with 100% accuracy. AI and ML are promising tools in identifying disease-associated genes that will advance the field of targeted therapeutics against genetic diseases.
阿尔茨海默病(AD)是一种痴呆症,具有进行性和慢性特征,会损害老年人的成年生活。该病的发病机制主要尚未确定,这使得治疗效果更加困难。因此,了解 AD 的遗传病因对于确定靶向治疗方法至关重要。本研究旨在使用 AD 患者表达基因的机器学习技术来识别潜在的生物标志物,可用于未来的治疗。该数据集来自基因表达综合数据库(GEO)(访问号:GSE36980)。针对非 AD 模型分别研究了亚组(来自额叶、海马和颞叶的 AD 血液样本)。使用 STRING 数据库进行优先基因簇分析。使用各种监督机器学习(ML)分类算法对候选基因生物标志物进行训练。使用可解释的人工智能(AI)技术对模型预测进行解释。该实验从额叶、海马和颞叶分别揭示了 34、60 和 28 个 AD 目标生物标志物基因。ORA12 被鉴定为与 AD 进展强烈相关的三个区域的共同生物标志物。通路分析表明,STIM1 和 TRPC3 与 ORAI2 强烈相关。我们在 ORAI2 基因的网络中发现了三个枢纽基因 TPI1、STIM1 和 TRPC3,它们可能参与 AD 的分子发病机制。朴素贝叶斯分类器通过五折交叉验证以 100%的准确率对不同组的样本进行分类。AI 和 ML 是识别与疾病相关的基因的有前途的工具,这将推动针对遗传疾病的靶向治疗领域的发展。