• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于增强中风预测的集成机器学习和数据挖掘方法。

An Ensemble Machine Learning and Data Mining Approach to Enhance Stroke Prediction.

作者信息

Wijaya Richard, Saeed Faisal, Samimi Parnia, Albarrak Abdullah M, Qasem Sultan Noman

机构信息

College of Computing and Digital Technology, Birmingham City University, Birmingham B4 7XG, UK.

Computer Science Department, College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh 11432, Saudi Arabia.

出版信息

Bioengineering (Basel). 2024 Jul 2;11(7):672. doi: 10.3390/bioengineering11070672.

DOI:10.3390/bioengineering11070672
PMID:39061754
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11274138/
Abstract

Stroke poses a significant health threat, affecting millions annually. Early and precise prediction is crucial to providing effective preventive healthcare interventions. This study applied an ensemble machine learning and data mining approach to enhance the effectiveness of stroke prediction. By employing the cross-industry standard process for data mining (CRISP-DM) methodology, various techniques, including random forest, ExtraTrees, XGBoost, artificial neural network (ANN), and genetic algorithm with ANN (GANN) were applied on two benchmark datasets to predict stroke based on several parameters, such as gender, age, various diseases, smoking status, BMI, HighCol, physical activity, hypertension, heart disease, lifestyle, and others. Due to dataset imbalance, Synthetic Minority Oversampling Technique (SMOTE) was applied to the datasets. Hyperparameter tuning optimized the models via grid search and randomized search cross-validation. The evaluation metrics included accuracy, precision, recall, F1-score, and area under the curve (AUC). The experimental results show that the ensemble ExtraTrees classifier achieved the highest accuracy (98.24%) and AUC (98.24%). Random forest also performed well, achieving 98.03% in both accuracy and AUC. Comparisons with state-of-the-art stroke prediction methods revealed that the proposed approach demonstrates superior performance, indicating its potential as a promising method for stroke prediction and offering substantial benefits to healthcare.

摘要

中风对健康构成重大威胁,每年影响数百万人。早期准确预测对于提供有效的预防性医疗干预至关重要。本研究应用集成机器学习和数据挖掘方法来提高中风预测的有效性。通过采用跨行业数据挖掘标准流程(CRISP-DM)方法,将包括随机森林、极端随机树、XGBoost、人工神经网络(ANN)以及带有ANN的遗传算法(GANN)等各种技术应用于两个基准数据集,以基于性别、年龄、各种疾病、吸烟状况、体重指数、总胆固醇、身体活动、高血压、心脏病、生活方式等多个参数预测中风。由于数据集不平衡,对数据集应用了合成少数过采样技术(SMOTE)。超参数调整通过网格搜索和随机搜索交叉验证对模型进行了优化。评估指标包括准确率、精确率、召回率、F1分数和曲线下面积(AUC)。实验结果表明,集成极端随机树分类器实现了最高准确率(98.24%)和AUC(98.24%)。随机森林也表现良好,准确率和AUC均达到98.03%。与现有最先进的中风预测方法的比较表明,所提出的方法表现出卓越性能,表明其作为一种有前景的中风预测方法的潜力,并为医疗保健带来巨大益处。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/6d67ea3c9c9d/bioengineering-11-00672-g022.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/0acbdd46beab/bioengineering-11-00672-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/4abadc3650e9/bioengineering-11-00672-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/6013bfc7fb98/bioengineering-11-00672-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/d7c7e0a0e379/bioengineering-11-00672-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/5a4ade689ca8/bioengineering-11-00672-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/c8d3f3e3b974/bioengineering-11-00672-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/5852a4cbc65e/bioengineering-11-00672-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/de8f6609430a/bioengineering-11-00672-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/11115cd1382b/bioengineering-11-00672-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/9f9d659b4431/bioengineering-11-00672-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/79634e2d7ffe/bioengineering-11-00672-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/ac7ea4d84c2d/bioengineering-11-00672-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/238f4b2f62e4/bioengineering-11-00672-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/ba797189adc5/bioengineering-11-00672-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/4a7d247dcf16/bioengineering-11-00672-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/4de07d18b4bf/bioengineering-11-00672-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/d229d89b1371/bioengineering-11-00672-g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/223ea6da57a5/bioengineering-11-00672-g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/df1467e6b70b/bioengineering-11-00672-g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/b0f8e5714e7c/bioengineering-11-00672-g020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/77c951b4a887/bioengineering-11-00672-g021.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/6d67ea3c9c9d/bioengineering-11-00672-g022.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/0acbdd46beab/bioengineering-11-00672-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/4abadc3650e9/bioengineering-11-00672-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/6013bfc7fb98/bioengineering-11-00672-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/d7c7e0a0e379/bioengineering-11-00672-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/5a4ade689ca8/bioengineering-11-00672-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/c8d3f3e3b974/bioengineering-11-00672-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/5852a4cbc65e/bioengineering-11-00672-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/de8f6609430a/bioengineering-11-00672-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/11115cd1382b/bioengineering-11-00672-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/9f9d659b4431/bioengineering-11-00672-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/79634e2d7ffe/bioengineering-11-00672-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/ac7ea4d84c2d/bioengineering-11-00672-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/238f4b2f62e4/bioengineering-11-00672-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/ba797189adc5/bioengineering-11-00672-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/4a7d247dcf16/bioengineering-11-00672-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/4de07d18b4bf/bioengineering-11-00672-g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/d229d89b1371/bioengineering-11-00672-g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/223ea6da57a5/bioengineering-11-00672-g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/df1467e6b70b/bioengineering-11-00672-g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/b0f8e5714e7c/bioengineering-11-00672-g020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/77c951b4a887/bioengineering-11-00672-g021.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/caa9/11274138/6d67ea3c9c9d/bioengineering-11-00672-g022.jpg

相似文献

1
An Ensemble Machine Learning and Data Mining Approach to Enhance Stroke Prediction.一种用于增强中风预测的集成机器学习和数据挖掘方法。
Bioengineering (Basel). 2024 Jul 2;11(7):672. doi: 10.3390/bioengineering11070672.
2
RDET stacking classifier: a novel machine learning based approach for stroke prediction using imbalance data.RDET堆叠分类器:一种基于机器学习的新型方法,用于利用不平衡数据进行中风预测。
PeerJ Comput Sci. 2023 Nov 21;9:e1684. doi: 10.7717/peerj-cs.1684. eCollection 2023.
3
Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage.利用电子病历数据构建机器学习模型的联合建模策略:以脑出血为例。
BMC Med Inform Decis Mak. 2022 Oct 25;22(1):278. doi: 10.1186/s12911-022-02018-x.
4
Enhanced osteoporotic fracture prediction in postmenopausal women using Bayesian optimization of machine learning models with genetic risk score.使用具有遗传风险评分的机器学习模型的贝叶斯优化增强绝经后女性骨质疏松性骨折预测
J Bone Miner Res. 2024 May 2;39(4):462-472. doi: 10.1093/jbmr/zjae025.
5
Development of an efficient novel method for coronary artery disease prediction using machine learning and deep learning techniques.利用机器学习和深度学习技术开发一种用于冠心病预测的高效新方法。
Technol Health Care. 2024;32(6):4545-4569. doi: 10.3233/THC-240740.
6
Prediction of diabetes disease using an ensemble of machine learning multi-classifier models.使用机器学习多分类器集成模型预测糖尿病疾病。
BMC Bioinformatics. 2023 Sep 12;24(1):337. doi: 10.1186/s12859-023-05465-z.
7
Predicting sexually transmitted infections among men who have sex with men in Zimbabwe using deep learning and ensemble machine learning models.使用深度学习和集成机器学习模型预测津巴布韦男男性行为者中的性传播感染情况。
PLOS Digit Health. 2024 Jul 3;3(7):e0000541. doi: 10.1371/journal.pdig.0000541. eCollection 2024 Jul.
8
A new hybrid ensemble machine-learning model for severity risk assessment and post-COVID prediction system.一种新的混合集成机器学习模型,用于严重程度风险评估和 COVID 后预测系统。
Math Biosci Eng. 2022 Apr 13;19(6):6102-6123. doi: 10.3934/mbe.2022285.
9
Development and validation of a web-based artificial intelligence prediction model to assess massive intraoperative blood loss for metastatic spinal disease using machine learning techniques.开发和验证一种基于网络的人工智能预测模型,使用机器学习技术评估转移性脊柱疾病的术中大量失血。
Spine J. 2024 Jan;24(1):146-160. doi: 10.1016/j.spinee.2023.09.001. Epub 2023 Sep 11.
10
Establishment and validation of an interactive artificial intelligence platform to predict postoperative ambulatory status for patients with metastatic spinal disease: a multicenter analysis.建立和验证交互式人工智能平台,以预测转移性脊柱疾病患者的术后活动状态:一项多中心分析。
Int J Surg. 2024 May 1;110(5):2738-2756. doi: 10.1097/JS9.0000000000001169.

引用本文的文献

1
Comparison of machine learning models for predicting stroke risk in hypertensive patients: Lasso regression model, random forest model, Boruta algorithm model, and Boruta algorithm combined with Lasso regression model.预测高血压患者中风风险的机器学习模型比较:套索回归模型、随机森林模型、Boruta算法模型以及Boruta算法与套索回归模型相结合的模型。
Medicine (Baltimore). 2025 May 30;104(22):e42690. doi: 10.1097/MD.0000000000042690.

本文引用的文献

1
Real World-Big Data Analytics in Healthcare.真实世界-医疗保健中的大数据分析。
Int J Environ Res Public Health. 2022 Sep 16;19(18):11677. doi: 10.3390/ijerph191811677.
2
Stroke Risk Prediction with Machine Learning Techniques.基于机器学习技术的中风风险预测。
Sensors (Basel). 2022 Jun 21;22(13):4670. doi: 10.3390/s22134670.
3
Prediction of 3-year risk of diabetic kidney disease using machine learning based on electronic medical records.基于电子病历的机器学习预测糖尿病肾病 3 年风险。
J Transl Med. 2022 Mar 26;20(1):143. doi: 10.1186/s12967-022-03339-1.
4
COVID-19 image classification using deep learning: Advances, challenges and opportunities.基于深度学习的 COVID-19 图像分类:进展、挑战与机遇。
Comput Biol Med. 2022 May;144:105350. doi: 10.1016/j.compbiomed.2022.105350. Epub 2022 Mar 3.
5
Prediction of diabetic kidney disease with machine learning algorithms, upon the initial diagnosis of type 2 diabetes mellitus.基于机器学习算法的 2 型糖尿病初始诊断时的糖尿病肾病预测。
BMJ Open Diabetes Res Care. 2022 Jan;10(1). doi: 10.1136/bmjdrc-2021-002560.
6
Deep Learning-Based Stroke Disease Prediction System Using Real-Time Bio Signals.基于深度学习的实时生物信号脑卒中疾病预测系统。
Sensors (Basel). 2021 Jun 22;21(13):4269. doi: 10.3390/s21134269.
7
Stroke Prediction with Machine Learning Methods among Older Chinese.基于机器学习方法对中国老年人进行中风预测。
Int J Environ Res Public Health. 2020 Mar 12;17(6):1828. doi: 10.3390/ijerph17061828.
8
Values, challenges and future directions of big data analytics in healthcare: A systematic review.大数据分析在医疗保健中的价值、挑战和未来方向:系统评价。
Soc Sci Med. 2019 Nov;241:112533. doi: 10.1016/j.socscimed.2019.112533. Epub 2019 Sep 10.
9
The Use of Deep Learning to Predict Stroke Patient Mortality.深度学习在预测脑卒中患者死亡率中的应用。
Int J Environ Res Public Health. 2019 May 28;16(11):1876. doi: 10.3390/ijerph16111876.
10
Prediction of fatty liver disease using machine learning algorithms.使用机器学习算法预测脂肪肝疾病。
Comput Methods Programs Biomed. 2019 Mar;170:23-29. doi: 10.1016/j.cmpb.2018.12.032. Epub 2018 Dec 29.