• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习利用血液基因表达和临床数据进行阿尔茨海默病阶段诊断:一项比较研究。

Machine Learning-Based Alzheimer's Disease Stage Diagnosis Utilizing Blood Gene Expression and Clinical Data: A Comparative Investigation.

作者信息

Sarma Manash, Chatterjee Subarna

机构信息

Department of Computer Science and Engineering, Faculty of Engineering and Technology, Technology Campus (Peenya Campus), Ramaiah University of Applied Sciences, Bengaluru 560058, India.

出版信息

Diagnostics (Basel). 2025 Jan 17;15(2):211. doi: 10.3390/diagnostics15020211.

DOI:10.3390/diagnostics15020211
PMID:39857095
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11765009/
Abstract

This study presents a comparative analysis of the multistage diagnosis of Alzheimer's disease (AD), including mild cognitive impairment (MCI), utilizing two distinct types of biomarkers: blood gene expression and clinical biomarker samples. Both of these samples, obtained from participants in the Alzheimer's Disease Neuroimaging Initiative (ADNI), were independently analyzed utilizing machine learning (ML)-based multiclassifiers. This study applied novel machine learning-based data augmentation techniques to gene expression profile data that are high-dimensional, low-sample-size (HDLSS) and inherently highly imbalanced. The investigation obtained the highest multiclassification performance to date in the multistage diagnosis of Alzheimer's disease utilizing the blood gene expression profiles of Alzheimer's Disease Neuroimaging Initiative (ADNI) participants. Based on the performance results obtained, and other factors such as early prediction capabilities, this study compares the efficacies of the two types of biomarkers for multistage diagnosis. This study presents the sole investigation in which multiclassification-based AD stage diagnosis was conducted utilizing blood gene expression data. We obtained the best multiclassification result in both modalities of the ADNI data in terms of F1-score and were able to identify new genetic biomarkers. The combination of the XGBoost and SFBS (Sequential Floating Backward Selection) methods was used to select the features. We were able to select the 95 most effective gene probe sets out of 49,386. For the clinical study data, eight of the most effective biomarkers were selected using SFBS. A deep learning (DL) classifier was used to identify the stages-cognitive normal (CN), mild cognitive impairment (MCI), and Alzheimer's disease (AD)/dementia. DL, support vector machine (SVM), gradient boosting (GB), and random forest (RF) classifiers were used for the AD stage detection from gene expression profile data. Because of the high data imbalance in genomic data, borderline oversampling/data augmentation was applied in the model training and original samples for validation. Utilizing clinical data, the highest ROC AUC scores attained were 0.989, 0.927, and 0.907 for the identification of the CN, MCI, and dementia stages, respectively. The highest F1 scores achieved were 0.971, 0.939, and 0.886. Employing gene expression data, we obtained ROC AUC scores of 0.763, 0.761, and 0.706 for the CN, MCI, and dementia stages, respectively, and F1 scores of 0.71, 0.77, and 0.53 for CN, MCI, and dementia, respectively. This represents the best outcome to date for AD stage diagnosis from ADNI blood gene expression profile data utilizing multiclassification techniques. The results indicated that our multiclassification model effectively manages the imbalanced data of a high-dimension, low-sample-size (HDLSS) nature to identify samples of the minority class. MAPK14, PLG, FZD2, FXYD6, and TEP1 are among the novel genes identified as being associated with AD risk.

摘要

本研究对阿尔茨海默病(AD)的多阶段诊断进行了比较分析,包括轻度认知障碍(MCI),使用了两种不同类型的生物标志物:血液基因表达和临床生物标志物样本。这两种样本均取自阿尔茨海默病神经影像倡议(ADNI)的参与者,并利用基于机器学习(ML)的多分类器进行独立分析。本研究将基于机器学习的新型数据增强技术应用于高维、低样本量(HDLSS)且本质上高度不平衡的基因表达谱数据。该研究利用阿尔茨海默病神经影像倡议(ADNI)参与者的血液基因表达谱,在阿尔茨海默病的多阶段诊断中取得了迄今为止最高的多分类性能。基于所获得的性能结果以及早期预测能力等其他因素,本研究比较了这两种类型生物标志物在多阶段诊断中的有效性。本研究是唯一一项利用血液基因表达数据进行基于多分类的AD阶段诊断的调查。在F1分数方面,我们在ADNI数据的两种模式中均获得了最佳的多分类结果,并且能够识别新的遗传生物标志物。使用XGBoost和SFBS(顺序浮动后向选择)方法的组合来选择特征。我们能够从49386个基因探针集中选出95个最有效的基因探针集。对于临床研究数据,使用SFBS选择了8个最有效的生物标志物。使用深度学习(DL)分类器来识别认知正常(CN)、轻度认知障碍(MCI)和阿尔茨海默病(AD)/痴呆阶段。使用DL、支持向量机(SVM)、梯度提升(GB)和随机森林(RF)分类器从基因表达谱数据中进行AD阶段检测。由于基因组数据中存在高度的数据不平衡,在模型训练和用于验证的原始样本中应用了边界过采样/数据增强。利用临床数据,识别CN、MCI和痴呆阶段时获得的最高ROC AUC分数分别为0.989、0.927和0.907。获得的最高F1分数分别为0.971、0.939和0.886。利用基因表达数据,CN、MCI和痴呆阶段的ROC AUC分数分别为0.763、0.761和0.706,CN、MCI和痴呆的F1分数分别为0.71、0.77和0.53。这代表了利用多分类技术从ADNI血液基因表达谱数据进行AD阶段诊断的迄今为止的最佳结果。结果表明,我们的多分类模型有效地管理了高维、低样本量(HDLSS)性质的不平衡数据,以识别少数类别的样本。MAPK14、PLG、FZD2、FXYD6和TEP1是被确定与AD风险相关的新基因。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/78bfcc6e6631/diagnostics-15-00211-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/93e8ed741eeb/diagnostics-15-00211-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/99cb7fef459a/diagnostics-15-00211-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/7e41463269e7/diagnostics-15-00211-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/a5fcfd4c9a61/diagnostics-15-00211-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/008c5d8a47b2/diagnostics-15-00211-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/4dc2db6d6669/diagnostics-15-00211-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/5bf6aa1ff447/diagnostics-15-00211-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/6bf211a8f485/diagnostics-15-00211-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/0811290ee642/diagnostics-15-00211-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/a6c2a69e2dce/diagnostics-15-00211-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/7e2fb040e86b/diagnostics-15-00211-g011a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/1b1f7c0330ea/diagnostics-15-00211-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/78bfcc6e6631/diagnostics-15-00211-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/93e8ed741eeb/diagnostics-15-00211-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/99cb7fef459a/diagnostics-15-00211-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/7e41463269e7/diagnostics-15-00211-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/a5fcfd4c9a61/diagnostics-15-00211-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/008c5d8a47b2/diagnostics-15-00211-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/4dc2db6d6669/diagnostics-15-00211-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/5bf6aa1ff447/diagnostics-15-00211-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/6bf211a8f485/diagnostics-15-00211-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/0811290ee642/diagnostics-15-00211-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/a6c2a69e2dce/diagnostics-15-00211-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/7e2fb040e86b/diagnostics-15-00211-g011a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/1b1f7c0330ea/diagnostics-15-00211-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d014/11765009/78bfcc6e6631/diagnostics-15-00211-g013.jpg

相似文献

1
Machine Learning-Based Alzheimer's Disease Stage Diagnosis Utilizing Blood Gene Expression and Clinical Data: A Comparative Investigation.基于机器学习利用血液基因表达和临床数据进行阿尔茨海默病阶段诊断:一项比较研究。
Diagnostics (Basel). 2025 Jan 17;15(2):211. doi: 10.3390/diagnostics15020211.
2
XGBoost-SHAP-based interpretable diagnostic framework for alzheimer's disease.基于 XGBoost-SHAP 的阿尔茨海默病可解释诊断框架。
BMC Med Inform Decis Mak. 2023 Jul 25;23(1):137. doi: 10.1186/s12911-023-02238-9.
3
Graph Convolutional Network for AD and MCI Diagnosis Utilizing Peripheral DNA Methylation: Réseau de neurones en graphes pour le diagnostic de la MA et du TCL à l'aide de la méthylation de l'ADN périphérique.利用外周血DNA甲基化的阿尔茨海默病和轻度认知障碍诊断的图卷积网络:使用外周血DNA甲基化进行阿尔茨海默病和轻度认知障碍诊断的图神经网络
Can J Psychiatry. 2024 Dec;69(12):869-879. doi: 10.1177/07067437241300947. Epub 2024 Nov 25.
4
Comparing different algorithms for the course of Alzheimer's disease using machine learning.使用机器学习比较阿尔茨海默病病程的不同算法。
Ann Palliat Med. 2021 Sep;10(9):9715-9724. doi: 10.21037/apm-21-2013.
5
c-Diadem: a constrained dual-input deep learning model to identify novel biomarkers in Alzheimer's disease.c-Diadem:一种受限双输入深度学习模型,用于识别阿尔茨海默病中的新型生物标志物。
BMC Med Genomics. 2023 Oct 13;16(Suppl 2):244. doi: 10.1186/s12920-023-01675-9.
6
MRI-based mild cognitive impairment and Alzheimer's disease classification using an algorithm of combination of variational autoencoder and other machine learning classifiers.基于磁共振成像的轻度认知障碍和阿尔茨海默病分类:使用变分自编码器与其他机器学习分类器相结合的算法
J Alzheimers Dis Rep. 2024 Oct 18;8(1):1434-1452. doi: 10.1177/25424823241290694. eCollection 2024.
7
A machine learning approach for identifying anatomical biomarkers of early mild cognitive impairment.一种用于识别早期轻度认知障碍解剖生物标志物的机器学习方法。
PeerJ. 2024 Dec 13;12:e18490. doi: 10.7717/peerj.18490. eCollection 2024.
8
Machine learning framework for early MRI-based Alzheimer's conversion prediction in MCI subjects.用于基于磁共振成像(MRI)早期预测轻度认知障碍(MCI)患者向阿尔茨海默病转化的机器学习框架。
Neuroimage. 2015 Jan 1;104:398-412. doi: 10.1016/j.neuroimage.2014.10.002. Epub 2014 Oct 12.
9
Cross-cohort generalizability of deep and conventional machine learning for MRI-based diagnosis and prediction of Alzheimer's disease.基于 MRI 的阿尔茨海默病诊断和预测的深度学习和传统机器学习在不同队列间的泛化能力。
Neuroimage Clin. 2021;31:102712. doi: 10.1016/j.nicl.2021.102712. Epub 2021 Jun 4.
10
ApoE4 effects on automated diagnostic classifiers for mild cognitive impairment and Alzheimer's disease.载脂蛋白E4对轻度认知障碍和阿尔茨海默病自动诊断分类器的影响。
Neuroimage Clin. 2014 Jan 4;4:461-72. doi: 10.1016/j.nicl.2013.12.012. eCollection 2014.

本文引用的文献

1
Etiology of Late-Onset Alzheimer's Disease, Biomarker Efficacy, and the Role of Machine Learning in Stage Diagnosis.迟发性阿尔茨海默病的病因、生物标志物功效及机器学习在阶段诊断中的作用
Diagnostics (Basel). 2024 Nov 23;14(23):2640. doi: 10.3390/diagnostics14232640.
2
Predicting early Alzheimer's with blood biomarkers and clinical features.利用血液生物标志物和临床特征预测早期阿尔茨海默病。
Sci Rep. 2024 Mar 13;14(1):6039. doi: 10.1038/s41598-024-56489-1.
3
A Deep Learning-Based Ensemble Method for Early Diagnosis of Alzheimer's Disease using MRI Images.
基于深度学习的基于 MRI 图像的阿尔茨海默病早期诊断的集成方法。
Neuroinformatics. 2024 Jan;22(1):89-105. doi: 10.1007/s12021-023-09646-2. Epub 2023 Dec 2.
4
Prediction of Alzheimer's Disease by a Novel Image-Based Representation of Gene Expression.基于基因表达的新型图像表示对阿尔茨海默病的预测。
Genes (Basel). 2022 Aug 8;13(8):1406. doi: 10.3390/genes13081406.
5
Clinical Functional Genomics.临床功能基因组学
Cancers (Basel). 2021 Sep 15;13(18):4627. doi: 10.3390/cancers13184627.
6
Identification of Alzheimer's disease using a convolutional neural network model based on T1-weighted magnetic resonance imaging.基于 T1 加权磁共振成像的卷积神经网络模型识别阿尔茨海默病。
Sci Rep. 2020 Dec 17;10(1):22252. doi: 10.1038/s41598-020-79243-9.
7
Prediction of Alzheimer's disease using blood gene expression data.利用血液基因表达数据预测阿尔茨海默病。
Sci Rep. 2020 Feb 26;10(1):3485. doi: 10.1038/s41598-020-60595-1.
8
Classification of Alzheimer's Disease with and without Imagery using Gradient Boosted Machines and ResNet-50.使用梯度提升机和ResNet-50对有无图像的阿尔茨海默病进行分类。
Brain Sci. 2019 Aug 22;9(9):212. doi: 10.3390/brainsci9090212.
9
A Meta-Analysis of Alzheimer's Disease Brain Transcriptomic Data.阿尔茨海默病脑转录组数据的荟萃分析。
J Alzheimers Dis. 2019;68(4):1635-1656. doi: 10.3233/JAD-181085.
10
Revealing Alzheimer's disease genes spectrum in the whole-genome by machine learning.通过机器学习揭示全基因组中的阿尔茨海默病基因谱。
BMC Neurol. 2018 Jan 10;18(1):5. doi: 10.1186/s12883-017-1010-3.