Sun Dazhong, Peng Haojun, Wu Zhibing
The First Clinical Medical School, Guangzhou University of Chinese Medicine, Guangzhou, China.
Front Aging Neurosci. 2022 Jun 30;14:921906. doi: 10.3389/fnagi.2022.921906. eCollection 2022.
Alzheimer's disease (AD) is a neurodegenerative condition that causes cognitive decline over time. Because existing diagnostic approaches for AD are limited, improving upon previously established diagnostic models based on genetic biomarkers is necessary. Firstly, four AD gene expression datasets were collected from the Gene Expression Omnibus (GEO) database. Two datasets were used to establish diagnostic models, and the other two datasets were used to verify the model effect. We merged GSE5281 with GSE44771 as the training dataset and found 120 DEGs. Then, we used random forest (RF) to screen 6 key genes (KLF15, MAFF, ITPKB, SST, DDIT4, and NRXN3) as being critical for separating AD and normal samples. The weights of these key genes were measured, and a diagnostic model was created using an artificial neural network (ANN). The area under the curve (AUC) of the model is 0.953, while the accuracy is 0.914. In the final step, two validation datasets were utilized to assess AUC performance. In GSE109887, our model had an AUC of 0.854, and in GSE132903, it had an AUC of 0.810. To summarize, we successfully identified key gene biomarkers and developed a new AD diagnostic model.
阿尔茨海默病(AD)是一种神经退行性疾病,会随着时间的推移导致认知能力下降。由于现有的AD诊断方法有限,因此有必要改进基于基因生物标志物的先前建立的诊断模型。首先,从基因表达综合数据库(GEO)中收集了四个AD基因表达数据集。其中两个数据集用于建立诊断模型,另外两个数据集用于验证模型效果。我们将GSE5281与GSE44771合并作为训练数据集,共发现120个差异表达基因(DEG)。然后,我们使用随机森林(RF)筛选出6个关键基因(KLF15、MAFF、ITPKB、SST、DDIT4和NRXN3),这些基因对于区分AD样本和正常样本至关重要。测量了这些关键基因的权重,并使用人工神经网络(ANN)创建了一个诊断模型。该模型的曲线下面积(AUC)为0.953,准确率为0.914。在最后一步中,使用两个验证数据集评估AUC性能。在GSE109887中,我们的模型AUC为0.854,在GSE132903中,AUC为0.810。总之,我们成功鉴定了关键基因生物标志物并开发了一种新的AD诊断模型。