• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过预测建模对CatBoost机器学习方法进行微调以早期检测心血管疾病。

Fine tuned CatBoost machine learning approach for early detection of cardiovascular disease through predictive modeling.

作者信息

Hamid Muhammad, Hajjej Fahima, Alluhaidan Ala Saleh, Bin Mannie Norah Waleed

机构信息

Department of Computer Science, Government College Women University Sialkot, Sialkot, 51310, Pakistan.

Department of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh, 11671, Saudi Arabia.

出版信息

Sci Rep. 2025 Aug 25;15(1):31199. doi: 10.1038/s41598-025-13790-x.

DOI:10.1038/s41598-025-13790-x
PMID:40854918
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12378338/
Abstract

Cardiovascular disease (CVD) remains one of the leading causes of morbidity and mortality worldwide, highlighting the urgent need for early-stage diagnosis to improve clinical outcomes. Machine learning (ML) approaches have demonstrated substantial potential in predictive modeling for CVD risk assessment. In this study, we propose an advanced predictive model based on the CatBoost algorithm to classify various stages of CVD using hospital records as the primary data source. The dataset, sourced from a publicly available repository, comprises 12 key predictor variables. The proposed methodology incorporates feature selection, rigorous validation processes, and data augmentation to enhance predictive performance and address the challenges associated with high-dimensional medical data. Among several ML algorithms evaluated, the fine-tuned CatBoost model achieved the highest performance, automating feature selection and facilitating the detection of early-stage heart disease. The model attained an impressive F1-score of 99% and an overall accuracy of 99.02%, outperforming existing ML-based approaches. These findings underscore the potential of the CatBoost algorithm for rapid and accurate CVD diagnosis, thereby supporting clinical decision-making. Future work will focus on external validation and testing on independent datasets to further assess the model's generalizability and clinical applicability.

摘要

心血管疾病(CVD)仍然是全球发病和死亡的主要原因之一,这凸显了早期诊断以改善临床结果的迫切需求。机器学习(ML)方法在心血管疾病风险评估的预测建模中已显示出巨大潜力。在本研究中,我们提出了一种基于CatBoost算法的先进预测模型,以医院记录作为主要数据源对心血管疾病的各个阶段进行分类。该数据集来自一个公开可用的存储库,包含12个关键预测变量。所提出的方法包括特征选择、严格的验证过程和数据增强,以提高预测性能并应对与高维医学数据相关的挑战。在评估的几种机器学习算法中,经过微调的CatBoost模型表现最佳,它能自动进行特征选择并有助于早期心脏病的检测。该模型获得了令人印象深刻的99%的F1分数和99.02%的总体准确率,优于现有的基于机器学习的方法。这些发现强调了CatBoost算法在快速准确诊断心血管疾病方面的潜力,从而支持临床决策。未来的工作将集中在独立数据集上进行外部验证和测试,以进一步评估该模型的通用性和临床适用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8587/12378338/580fbd50223b/41598_2025_13790_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8587/12378338/6a1e66dd56dd/41598_2025_13790_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8587/12378338/006f4d13dc09/41598_2025_13790_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8587/12378338/5c5cbd464879/41598_2025_13790_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8587/12378338/c7d49065c692/41598_2025_13790_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8587/12378338/d51a92bbdf63/41598_2025_13790_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8587/12378338/40ac49a15cb8/41598_2025_13790_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8587/12378338/3ce6af44410f/41598_2025_13790_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8587/12378338/9e5527706f97/41598_2025_13790_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8587/12378338/580fbd50223b/41598_2025_13790_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8587/12378338/6a1e66dd56dd/41598_2025_13790_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8587/12378338/006f4d13dc09/41598_2025_13790_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8587/12378338/5c5cbd464879/41598_2025_13790_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8587/12378338/c7d49065c692/41598_2025_13790_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8587/12378338/d51a92bbdf63/41598_2025_13790_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8587/12378338/40ac49a15cb8/41598_2025_13790_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8587/12378338/3ce6af44410f/41598_2025_13790_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8587/12378338/9e5527706f97/41598_2025_13790_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8587/12378338/580fbd50223b/41598_2025_13790_Fig9_HTML.jpg

相似文献

1
Fine tuned CatBoost machine learning approach for early detection of cardiovascular disease through predictive modeling.通过预测建模对CatBoost机器学习方法进行微调以早期检测心血管疾病。
Sci Rep. 2025 Aug 25;15(1):31199. doi: 10.1038/s41598-025-13790-x.
2
Optimized feature selection and advanced machine learning for stroke risk prediction in revascularized coronary artery disease patients.优化特征选择与先进机器学习用于预测冠状动脉疾病血运重建患者的卒中风险
BMC Med Inform Decis Mak. 2025 Jul 24;25(1):276. doi: 10.1186/s12911-025-03116-2.
3
A Responsible Framework for Assessing, Selecting, and Explaining Machine Learning Models in Cardiovascular Disease Outcomes Among People With Type 2 Diabetes: Methodology and Validation Study.用于评估、选择和解释2型糖尿病患者心血管疾病结局机器学习模型的责任框架:方法与验证研究
JMIR Med Inform. 2025 Jun 27;13:e66200. doi: 10.2196/66200.
4
Building gender-specific sexually transmitted infection risk prediction models using CatBoost algorithm and NHANES data.利用 CatBoost 算法和 NHANES 数据构建基于性别的性传播感染风险预测模型。
BMC Med Inform Decis Mak. 2024 Jan 24;24(1):24. doi: 10.1186/s12911-024-02426-1.
5
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
6
An open dataset and machine learning algorithms for Niacin Skin-Flushing Response based screening of psychiatric disorders.基于烟酸皮肤潮红反应筛查精神疾病的开放数据集和机器学习算法
BMC Psychiatry. 2025 Aug 4;25(1):757. doi: 10.1186/s12888-025-07196-2.
7
Enhanced Cardiovascular Disease Prediction Modelling using Machine Learning Techniques: A Focus on CardioVitalnet.使用机器学习技术增强心血管疾病预测模型:以CardioVitalnet为重点
Network. 2024 Apr 16:1-33. doi: 10.1080/0954898X.2024.2343341.
8
A study on the effectiveness of machine learning models for hepatitis prediction.关于机器学习模型用于肝炎预测有效性的研究。
Sci Rep. 2025 Aug 20;15(1):30659. doi: 10.1038/s41598-025-07104-4.
9
Predicting in-hospital mortality in ICU patients with Coronary heart disease and diabetes mellitus using machine learning models.使用机器学习模型预测冠心病合并糖尿病重症监护病房患者的院内死亡率。
PLoS One. 2025 Aug 14;20(8):e0330381. doi: 10.1371/journal.pone.0330381. eCollection 2025.
10
Predicting cancer risk using machine learning on lifestyle and genetic data.利用机器学习对生活方式和基因数据进行癌症风险预测。
Sci Rep. 2025 Aug 19;15(1):30458. doi: 10.1038/s41598-025-15656-8.

本文引用的文献

1
Interpretable deep learning architecture for gastrointestinal disease detection: A Tri-stage approach with PCA and XAI.用于胃肠疾病检测的可解释深度学习架构:一种结合主成分分析和可解释人工智能的三阶段方法。
Comput Biol Med. 2025 Feb;185:109503. doi: 10.1016/j.compbiomed.2024.109503. Epub 2024 Dec 7.
2
A hybridization of XGBoost machine learning model by Optuna hyperparameter tuning suite for cardiovascular disease classification with significant effect of outliers and heterogeneous training datasets.一种通过Optuna超参数调整套件对XGBoost机器学习模型进行的杂交,用于心血管疾病分类,对异常值和异构训练数据集有显著影响。
Int J Cardiol. 2025 Feb 1;420:132757. doi: 10.1016/j.ijcard.2024.132757. Epub 2024 Nov 28.
3
Cardiovascular Disease Risk Stratification Using Hybrid Deep Learning Paradigm: First of Its Kind on Canadian Trial Data.
使用混合深度学习范式进行心血管疾病风险分层:首次应用于加拿大试验数据
Diagnostics (Basel). 2024 Aug 28;14(17):1894. doi: 10.3390/diagnostics14171894.
4
Artificial intelligence for cardiovascular disease risk assessment in personalised framework: a scoping review.个性化框架下用于心血管疾病风险评估的人工智能:一项范围综述
EClinicalMedicine. 2024 May 27;73:102660. doi: 10.1016/j.eclinm.2024.102660. eCollection 2024 Jul.
5
A robust framework for enhancing cardiovascular disease risk prediction using an optimized category boosting model.利用优化分类提升模型增强心血管疾病风险预测的稳健框架。
Math Biosci Eng. 2024 Jan 29;21(2):2943-2969. doi: 10.3934/mbe.2024131.
6
A review on brain tumor segmentation based on deep learning methods with federated learning techniques.基于深度学习方法与联邦学习技术的脑肿瘤分割综述。
Comput Med Imaging Graph. 2023 Dec;110:102313. doi: 10.1016/j.compmedimag.2023.102313. Epub 2023 Nov 24.
7
IRv2-Net: A Deep Learning Framework for Enhanced Polyp Segmentation Performance Integrating InceptionResNetV2 and UNet Architecture with Test Time Augmentation Techniques.IRv2-Net:一种深度学习框架,用于通过集成 InceptionResNetV2 和 UNet 架构以及测试时增强技术来提高息肉分割性能。
Sensors (Basel). 2023 Sep 7;23(18):7724. doi: 10.3390/s23187724.
8
Early Prediction in Classification of Cardiovascular Diseases with Machine Learning, Neuro-Fuzzy and Statistical Methods.运用机器学习、神经模糊和统计方法对心血管疾病进行分类的早期预测
Biology (Basel). 2023 Jan 11;12(1):117. doi: 10.3390/biology12010117.
9
Personalized prediction of incident hospitalization for cardiovascular disease in patients with hypertension using machine learning.利用机器学习对高血压患者的心血管疾病发病住院进行个体化预测。
BMC Med Res Methodol. 2022 Dec 17;22(1):325. doi: 10.1186/s12874-022-01814-3.
10
Implementation of a Heart Disease Risk Prediction Model Using Machine Learning.基于机器学习的心脏病风险预测模型的实现。
Comput Math Methods Med. 2022 May 2;2022:6517716. doi: 10.1155/2022/6517716. eCollection 2022.