• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过集成学习和基于SHAP特征分析的特征可解释性优化阿尔茨海默病预测。

Optimizing Alzheimer's disease prediction through ensemble learning and feature interpretability with SHAP-based feature analysis.

作者信息

Hossain Md Kamrul, Ashraf Afrina, Islam Md Mominul, Sourav Shoriful Hassan, Shimul Md Monir Hossain

机构信息

Department of Computer Science and Engineering Daffodil International University Dhaka Bangladesh.

Department of Public Health Daffodil International University Dhaka Bangladesh.

出版信息

Alzheimers Dement (Amst). 2025 Aug 8;17(3):e70162. doi: 10.1002/dad2.70162. eCollection 2025 Jul-Sep.

DOI:10.1002/dad2.70162
PMID:40787633
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12333869/
Abstract

INTRODUCTION

Alzheimer's disease (AD) is a progressive neurodegenerative disorder and the leading cause of dementia. Early diagnosis is vital. We developed an interpretable machine learning (ML) model for early AD prediction using open clinical data.

METHODS

Data from 2149 adults (60-90 years) were obtained from Kaggle. After preprocessing and feature engineering, tree-based models were trained. A stacking ensemble model combining Gradient Boosting and XGBoost was trained, with Logistic Regression as the meta-learner. SHapley Additive exPlanations (SHAP) provided interpretability. Performance was measured by accuracy, precision, recall, F1 score, ROC and AUC.

RESULTS

The stacked ensemble achieved 97% accuracy (AUC 0.97), with 0.97 precision, 0.94 recall, and 0.96 F1 score for AD. SHAP identified memory complaints, Mini-Mental State Examination (MMSE), functional assessment, behavioral symptoms, cholesterol, and lifestyle factors (activity, diet, sleep) as top predictors.

CONCLUSION

The ensemble model, enhanced by SHAP analysis, provides accurate and interpretable AD risk predictions with potential applicability in future clinical decision support systems.

HIGHLIGHTS

Developed an ensemble machine learning (ML) model for early Alzheimer's disease (AD) prediction.Achieved 97% accuracy using stacked XGBoost and Gradient Boosting.SHapley Additive exPlanations (SHAP) analysis identified key cognitive and lifestyle-related risk factors.Model interprets AD risk using explainable artificial intelligence (AI) for clinical applicability.Utilized open-access dataset to ensure reproducibility and transparency.

摘要

引言

阿尔茨海默病(AD)是一种进行性神经退行性疾病,也是痴呆症的主要病因。早期诊断至关重要。我们利用公开的临床数据开发了一种可解释的机器学习(ML)模型,用于早期AD预测。

方法

从Kaggle获取了2149名成年人(60 - 90岁)的数据。经过预处理和特征工程后,训练基于树的模型。训练了一个结合梯度提升和XGBoost的堆叠集成模型,以逻辑回归作为元学习器。SHapley值加法解释(SHAP)提供可解释性。通过准确率、精确率、召回率、F1分数、ROC和AUC来衡量性能。

结果

堆叠集成模型的准确率达到97%(AUC为0.97),AD的精确率为0.97,召回率为0.94,F1分数为0.96。SHAP将记忆问题、简易精神状态检查表(MMSE)、功能评估、行为症状、胆固醇和生活方式因素(活动、饮食、睡眠)确定为主要预测因素。

结论

通过SHAP分析增强的集成模型提供了准确且可解释的AD风险预测,在未来临床决策支持系统中具有潜在的适用性。

要点

开发了一种用于早期阿尔茨海默病(AD)预测的集成机器学习(ML)模型。使用堆叠的XGBoost和梯度提升实现了97%的准确率。SHapley值加法解释(SHAP)分析确定了关键的认知和生活方式相关风险因素。该模型使用可解释人工智能(AI)解释AD风险以用于临床应用。利用开放获取数据集确保可重复性和透明度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d28e/12333869/9fe7a8087254/DAD2-17-e70162-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d28e/12333869/12235723c460/DAD2-17-e70162-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d28e/12333869/81977ed5c795/DAD2-17-e70162-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d28e/12333869/3abcb5dfad23/DAD2-17-e70162-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d28e/12333869/72b57efe9b2d/DAD2-17-e70162-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d28e/12333869/9fe7a8087254/DAD2-17-e70162-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d28e/12333869/12235723c460/DAD2-17-e70162-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d28e/12333869/81977ed5c795/DAD2-17-e70162-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d28e/12333869/3abcb5dfad23/DAD2-17-e70162-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d28e/12333869/72b57efe9b2d/DAD2-17-e70162-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d28e/12333869/9fe7a8087254/DAD2-17-e70162-g003.jpg

相似文献

1
Optimizing Alzheimer's disease prediction through ensemble learning and feature interpretability with SHAP-based feature analysis.通过集成学习和基于SHAP特征分析的特征可解释性优化阿尔茨海默病预测。
Alzheimers Dement (Amst). 2025 Aug 8;17(3):e70162. doi: 10.1002/dad2.70162. eCollection 2025 Jul-Sep.
2
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
3
Improving appendix cancer prediction with SHAP-based feature engineering for machine learning models: a prediction study.利用基于SHAP的机器学习模型特征工程改进阑尾癌预测:一项预测研究。
Ewha Med J. 2025 Apr;48(2):e31. doi: 10.12771/emj.2025.00297. Epub 2025 Apr 15.
4
Development of an explainable machine learning model for Alzheimer's disease prediction using clinical and behavioural features.利用临床和行为特征开发用于阿尔茨海默病预测的可解释机器学习模型。
MethodsX. 2025 Jul 7;15:103491. doi: 10.1016/j.mex.2025.103491. eCollection 2025 Dec.
5
Construction and validation of HBV-ACLF bacterial infection diagnosis model based on machine learning.基于机器学习的HBV-ACLF细菌感染诊断模型的构建与验证
BMC Infect Dis. 2025 Jul 1;25(1):847. doi: 10.1186/s12879-025-11199-5.
6
Development and validation of an explainable machine learning model for predicting postoperative pulmonary complications after lung cancer surgery: a machine learning study.用于预测肺癌手术后肺部并发症的可解释机器学习模型的开发与验证:一项机器学习研究
EClinicalMedicine. 2025 Aug 1;86:103386. doi: 10.1016/j.eclinm.2025.103386. eCollection 2025 Aug.
7
Interpretable prediction of hospital mortality in bleeding critically ill patients based on machine learning and SHAP.基于机器学习和SHAP对出血性危重症患者医院死亡率的可解释预测
BMC Med Inform Decis Mak. 2025 Jul 15;25(1):263. doi: 10.1186/s12911-025-03101-9.
8
Interpretable noninvasive diagnosis of tuberculous pleural effusion using LGBM and SHAP: development and clinical application of a machine learning model.使用LightGBM和SHAP对结核性胸腔积液进行可解释的无创诊断:机器学习模型的开发与临床应用
PeerJ. 2025 May 20;13:e19411. doi: 10.7717/peerj.19411. eCollection 2025.
9
Machine Learning Model for Predicting Coronary Heart Disease Risk: Development and Validation Using Insights From a Japanese Population-Based Study.预测冠心病风险的机器学习模型:基于日本人群研究的见解进行开发与验证
JMIR Cardio. 2025 May 12;9:e68066. doi: 10.2196/68066.
10
Prediction of Percutaneous Coronary Intervention Success in Patients With Moderate to Severe Coronary Artery Calcification Using Machine Learning Based on Coronary Angiography: Prospective Cohort Study.基于冠状动脉造影的机器学习预测中重度冠状动脉钙化患者经皮冠状动脉介入治疗的成功率:前瞻性队列研究
J Med Internet Res. 2025 Jul 11;27:e70943. doi: 10.2196/70943.

本文引用的文献

1
Improving Alzheimer's Disease Prediction with Different Machine Learning Approaches and Feature Selection Techniques.使用不同机器学习方法和特征选择技术改善阿尔茨海默病预测
Diagnostics (Basel). 2024 Oct 7;14(19):2237. doi: 10.3390/diagnostics14192237.
2
2024 Alzheimer's disease facts and figures.2024 年阿尔茨海默病事实和数据。
Alzheimers Dement. 2024 May;20(5):3708-3821. doi: 10.1002/alz.13809. Epub 2024 Apr 30.
3
An explainable machine learning approach for Alzheimer's disease classification.基于可解释机器学习的阿尔茨海默病分类方法。
Sci Rep. 2024 Feb 1;14(1):2637. doi: 10.1038/s41598-024-51985-w.
4
An explainable machine learning based prediction model for Alzheimer's disease in China longitudinal aging study.基于可解释机器学习的中国纵向衰老研究中阿尔茨海默病预测模型
Front Aging Neurosci. 2023 Nov 3;15:1267020. doi: 10.3389/fnagi.2023.1267020. eCollection 2023.
5
Explainable AI-based Alzheimer's prediction and management using multimodal data.基于可解释人工智能的多模态数据阿尔茨海默病预测与管理。
PLoS One. 2023 Nov 16;18(11):e0294253. doi: 10.1371/journal.pone.0294253. eCollection 2023.
6
Low- and High-Density Lipoprotein Cholesterol and Dementia Risk Over 17 Years of Follow-up Among Members of a Large Health Care Plan.在一项大型医疗保健计划成员中进行的长达 17 年的随访中,研究了低和高密度脂蛋白胆固醇与痴呆风险的关系。
Neurology. 2023 Nov 21;101(21):e2172-e2184. doi: 10.1212/WNL.0000000000207876. Epub 2023 Oct 4.
7
A Novel Approach Utilizing Machine Learning for the Early Diagnosis of Alzheimer's Disease.一种利用机器学习进行阿尔茨海默病早期诊断的新方法。
Biomed Mater Devices. 2023 Apr 10:1-17. doi: 10.1007/s44174-023-00078-9.
8
In-depth insights into Alzheimer's disease by using explainable machine learning approach.利用可解释的机器学习方法深入了解阿尔茨海默病。
Sci Rep. 2022 Apr 20;12(1):6508. doi: 10.1038/s41598-022-10202-2.
9
Early-Stage Alzheimer's Disease Prediction Using Machine Learning Models.使用机器学习模型预测早期阿尔茨海默病。
Front Public Health. 2022 Mar 3;10:853294. doi: 10.3389/fpubh.2022.853294. eCollection 2022.
10
Machine learning methods for predicting progression from mild cognitive impairment to Alzheimer's disease dementia: a systematic review.机器学习方法预测轻度认知障碍向阿尔茨海默病痴呆的进展:系统评价。
Alzheimers Res Ther. 2021 Sep 28;13(1):162. doi: 10.1186/s13195-021-00900-w.