• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

预测中风发生:具有特征选择和数据预处理的堆叠机器学习方法。

Predicting stroke occurrences: a stacked machine learning approach with feature selection and data preprocessing.

机构信息

School of computer engineering, KIIT University, Patia, Bhubaneswar, Odisha, 751024, India.

Department of Environmental Health, Harvard T H Chan School of public Health, 677 Harrington Avenue, Boston, MA, 02115, USA.

出版信息

BMC Bioinformatics. 2024 Oct 15;25(1):329. doi: 10.1186/s12859-024-05866-8.

DOI:10.1186/s12859-024-05866-8
PMID:39407112
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11476080/
Abstract

Stroke prediction remains a critical area of research in healthcare, aiming to enhance early intervention and patient care strategies. This study investigates the efficacy of machine learning techniques, particularly principal component analysis (PCA) and a stacking ensemble method, for predicting stroke occurrences based on demographic, clinical, and lifestyle factors. We systematically varied PCA components and implemented a stacking model comprising random forest, decision tree, and K-nearest neighbors (KNN).Our findings demonstrate that setting PCA components to 16 optimally enhanced predictive accuracy, achieving a remarkable 98.6% accuracy in stroke prediction. Evaluation metrics underscored the robustness of our approach in handling class imbalance and improving model performance, also comparative analyses against traditional machine learning algorithms such as SVM, logistic regression, and Naive Bayes highlighted the superiority of our proposed method.

摘要

中风预测仍然是医疗保健领域的一个关键研究领域,旨在加强早期干预和患者护理策略。本研究调查了机器学习技术,特别是主成分分析(PCA)和堆叠集成方法,基于人口统计学、临床和生活方式因素预测中风发生的效果。我们系统地改变了 PCA 成分,并实施了一个堆叠模型,其中包括随机森林、决策树和 K-最近邻(KNN)。我们的研究结果表明,将 PCA 成分设置为 16 可以最佳地提高预测准确性,中风预测的准确性达到了惊人的 98.6%。评估指标强调了我们的方法在处理类别不平衡和提高模型性能方面的稳健性,与传统机器学习算法(如 SVM、逻辑回归和朴素贝叶斯)的比较分析也突出了我们提出的方法的优越性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/86065ccc6361/12859_2024_5866_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/96466e0ff8ba/12859_2024_5866_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/f96a37b263b7/12859_2024_5866_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/b37f04814c6b/12859_2024_5866_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/f7d3c56fc151/12859_2024_5866_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/4f3f5c78ed4d/12859_2024_5866_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/e0ff7ea4ccdb/12859_2024_5866_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/2493d88572fd/12859_2024_5866_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/adf8fef027ae/12859_2024_5866_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/14aaa2c1ff10/12859_2024_5866_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/8bca54b2c95e/12859_2024_5866_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/3206a5539f0f/12859_2024_5866_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/a29f2a42e2a6/12859_2024_5866_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/d3fe77c4a9d3/12859_2024_5866_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/6d17cbe8c233/12859_2024_5866_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/c31ba313b0d7/12859_2024_5866_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/6ca4beaf443d/12859_2024_5866_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/86065ccc6361/12859_2024_5866_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/96466e0ff8ba/12859_2024_5866_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/f96a37b263b7/12859_2024_5866_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/b37f04814c6b/12859_2024_5866_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/f7d3c56fc151/12859_2024_5866_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/4f3f5c78ed4d/12859_2024_5866_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/e0ff7ea4ccdb/12859_2024_5866_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/2493d88572fd/12859_2024_5866_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/adf8fef027ae/12859_2024_5866_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/14aaa2c1ff10/12859_2024_5866_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/8bca54b2c95e/12859_2024_5866_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/3206a5539f0f/12859_2024_5866_Figb_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/a29f2a42e2a6/12859_2024_5866_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/d3fe77c4a9d3/12859_2024_5866_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/6d17cbe8c233/12859_2024_5866_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/c31ba313b0d7/12859_2024_5866_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/6ca4beaf443d/12859_2024_5866_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6cba/11476080/86065ccc6361/12859_2024_5866_Fig15_HTML.jpg

相似文献

1
Predicting stroke occurrences: a stacked machine learning approach with feature selection and data preprocessing.预测中风发生:具有特征选择和数据预处理的堆叠机器学习方法。
BMC Bioinformatics. 2024 Oct 15;25(1):329. doi: 10.1186/s12859-024-05866-8.
2
Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.用于预测埃塞俄比亚 COVID-19 死亡率的机器学习算法。
BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0.
3
Unveiling the potential of machine learning approaches in predicting the emergence of stroke at its onset: a predicting framework.揭示机器学习方法在预测中风发病时出现的潜力:一个预测框架。
Sci Rep. 2024 Aug 29;14(1):20053. doi: 10.1038/s41598-024-70354-1.
4
Precision healthcare: A deep dive into machine learning algorithms and feature selection strategies for accurate heart disease prediction.精准医疗:深入探讨用于准确预测心脏病的机器学习算法和特征选择策略。
Comput Biol Med. 2024 Jun;176:108432. doi: 10.1016/j.compbiomed.2024.108432. Epub 2024 May 10.
5
Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage.利用电子病历数据构建机器学习模型的联合建模策略:以脑出血为例。
BMC Med Inform Decis Mak. 2022 Oct 25;22(1):278. doi: 10.1186/s12911-022-02018-x.
6
Prediction and feature selection of low birth weight using machine learning algorithms.利用机器学习算法预测和选择低出生体重。
J Health Popul Nutr. 2024 Oct 12;43(1):157. doi: 10.1186/s41043-024-00647-8.
7
A Risk Prediction Model for Physical Restraints Among Older Chinese Adults in Long-term Care Facilities: Machine Learning Study.长期护理机构中老年人身体约束的风险预测模型:机器学习研究。
J Med Internet Res. 2023 Apr 6;25:e43815. doi: 10.2196/43815.
8
Decision tree-based learning to predict patient controlled analgesia consumption and readjustment.基于决策树的学习预测患者自控镇痛消耗和调整。
BMC Med Inform Decis Mak. 2012 Nov 14;12:131. doi: 10.1186/1472-6947-12-131.
9
Machine learning models predict triage levels, massive transfusion protocol activation, and mortality in trauma utilizing patients hemodynamics on admission.机器学习模型利用创伤患者入院时的血流动力学来预测分诊级别、大量输血方案的激活和死亡率。
Comput Biol Med. 2024 Sep;179:108880. doi: 10.1016/j.compbiomed.2024.108880. Epub 2024 Jul 16.
10
Comparative assessment of the capability of machine learning-based radiomic models for predicting omental metastasis in locally advanced gastric cancer.基于机器学习的放射组学模型预测局部进展期胃癌网膜转移能力的比较评估。
Sci Rep. 2024 Jul 13;14(1):16208. doi: 10.1038/s41598-024-66979-x.

引用本文的文献

1
OptiSelect and EnShap: Integrating machine learning and game theory for ischemic stroke prediction.OptiSelect与EnShap:将机器学习与博弈论整合用于缺血性中风预测
PLoS One. 2025 Aug 13;20(8):e0328967. doi: 10.1371/journal.pone.0328967. eCollection 2025.
2
Developing an Explainable Prognostic Model for Acute Ischemic Stroke: Combining Clinical and Inflammatory Biomarkers With Machine Learning.开发一种用于急性缺血性中风的可解释性预后模型:将临床和炎症生物标志物与机器学习相结合。
Brain Behav. 2025 Aug;15(8):e70673. doi: 10.1002/brb3.70673.
3
Construction and Validation of a PCA-Based Prediction Model for Preterm Infant Stunting Risk: A Retrospective Study.

本文引用的文献

1
A Machine Learning Approach to Support Urgent Stroke Triage Using Administrative Data and Social Determinants of Health at Hospital Presentation: Retrospective Study.一种基于机器学习的方法,利用医院就诊时的行政数据和健康社会决定因素来支持紧急脑卒中分诊:回顾性研究。
J Med Internet Res. 2023 Jan 30;25:e36477. doi: 10.2196/36477.
2
Deep learning-based personalised outcome prediction after acute ischaemic stroke.基于深度学习的急性缺血性中风后个性化预后预测
J Neurol Neurosurg Psychiatry. 2023 May;94(5):369-378. doi: 10.1136/jnnp-2022-330230. Epub 2023 Jan 17.
3
Stroke Risk Prediction with Machine Learning Techniques.
基于主成分分析的早产儿发育迟缓风险预测模型的构建与验证:一项回顾性研究
Children (Basel). 2025 Apr 30;12(5):583. doi: 10.3390/children12050583.
基于机器学习技术的中风风险预测。
Sensors (Basel). 2022 Jun 21;22(13):4670. doi: 10.3390/s22134670.
4
Rapid triage for ischemic stroke: a machine learning-driven approach in the context of predictive, preventive and personalised medicine.缺血性中风的快速分诊:预测、预防和个性化医学背景下的机器学习驱动方法。
EPMA J. 2022 May 27;13(2):285-298. doi: 10.1007/s13167-022-00283-4. eCollection 2022 Jun.
5
Machine learning methods for functional recovery prediction and prognosis in post-stroke rehabilitation: a systematic review.机器学习方法在脑卒中康复中功能恢复预测和预后的应用:系统综述。
J Neuroeng Rehabil. 2022 Jun 3;19(1):54. doi: 10.1186/s12984-022-01032-4.
6
Predicting the Internal Knee Abduction Impulse During Walking Using Deep Learning.使用深度学习预测步行过程中的膝关节内收冲量
Front Bioeng Biotechnol. 2022 May 12;10:877347. doi: 10.3389/fbioe.2022.877347. eCollection 2022.
7
Risk prediction of 30-day mortality after stroke using machine learning: a nationwide registry-based cohort study.基于全国登记注册队列研究的机器学习预测卒中后 30 天死亡率。
BMC Neurol. 2022 May 27;22(1):195. doi: 10.1186/s12883-022-02722-1.
8
A comprehensive evaluation of state-of-the-art time-series deep learning models for activity-recognition in post-stroke rehabilitation assessment.全面评估最先进的时间序列深度学习模型在脑卒中后康复评估中的活动识别应用。
Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:2242-2247. doi: 10.1109/EMBC46164.2021.9630462.
9
Stroke risk prediction using machine learning: a prospective cohort study of 0.5 million Chinese adults.基于机器学习的卒中风险预测:一项 50 万中国成年人的前瞻性队列研究。
J Am Med Inform Assoc. 2021 Jul 30;28(8):1719-1727. doi: 10.1093/jamia/ocab068.
10
Prediction of Long-Term Stroke Recurrence Using Machine Learning Models.使用机器学习模型预测长期中风复发
J Clin Med. 2021 Mar 20;10(6):1286. doi: 10.3390/jcm10061286.