• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于血常规和生化检测数据构建心血管多病种的机器学习诊断模型。

Construction of machine learning diagnostic models for cardiovascular pan-disease based on blood routine and biochemical detection data.

机构信息

Institute for Clinical Medical Research, School of Medicine, The First Affiliated Hospital of Xiamen University, Xiamen University, Xiamen, 361003, Fujian, China.

Department of Laboratory Medicine, Xiamen Key Laboratory of Genetic Testing, School of Medicine, the First Affiliated Hospital of Xiamen University, Xiamen University, Xiamen, 361003, Fujian, China.

出版信息

Cardiovasc Diabetol. 2024 Sep 28;23(1):351. doi: 10.1186/s12933-024-02439-0.

DOI:10.1186/s12933-024-02439-0
PMID:39342281
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11439295/
Abstract

BACKGROUND

Cardiovascular disease, also known as circulation system disease, remains the leading cause of morbidity and mortality worldwide. Traditional methods for diagnosing cardiovascular disease are often expensive and time-consuming. So the purpose of this study is to construct machine learning models for the diagnosis of cardiovascular diseases using easily accessible blood routine and biochemical detection data and explore the unique hematologic features of cardiovascular diseases, including some metabolic indicators.

METHODS

After the data preprocessing, 25,794 healthy people and 32,822 circulation system disease patients with the blood routine and biochemical detection data were utilized for our study. We selected logistic regression, random forest, support vector machine, eXtreme Gradient Boosting (XGBoost), and deep neural network to construct models. Finally, the SHAP algorithm was used to interpret models.

RESULTS

The circulation system disease prediction model constructed by XGBoost possessed the best performance (AUC: 0.9921 (0.9911-0.9930); Acc: 0.9618 (0.9588-0.9645); Sn: 0.9690 (0.9655-0.9723); Sp: 0.9526 (0.9477-0.9572); PPV: 0.9631 (0.9592-0.9668); NPV: 0.9600 (0.9556-0.9644); MCC: 0.9224 (0.9165-0.9279); F1 score: 0.9661 (0.9634-0.9686)). Most models of distinguishing various circulation system diseases also had good performance, the model performance of distinguishing dilated cardiomyopathy from other circulation system diseases was the best (AUC: 0.9267 (0.8663-0.9752)). The model interpretation by the SHAP algorithm indicated features from biochemical detection made major contributions to predicting circulation system disease, such as potassium (K), total protein (TP), albumin (ALB), and indirect bilirubin (NBIL). But for models of distinguishing various circulation system diseases, we found that red blood cell count (RBC), K, direct bilirubin (DBIL), and glucose (GLU) were the top 4 features subdividing various circulation system diseases.

CONCLUSIONS

The present study constructed multiple models using 50 features from the blood routine and biochemical detection data for the diagnosis of various circulation system diseases. At the same time, the unique hematologic features of various circulation system diseases, including some metabolic-related indicators, were also explored. This cost-effective work will benefit more people and help diagnose and prevent circulation system diseases.

摘要

背景

心血管疾病,又称循环系统疾病,仍是全球发病率和死亡率的主要原因。传统的心血管疾病诊断方法往往既昂贵又耗时。因此,本研究旨在使用易于获得的血常规和生化检测数据构建用于心血管疾病诊断的机器学习模型,并探索心血管疾病的独特血液学特征,包括一些代谢指标。

方法

在数据预处理后,我们使用了 25794 名健康人和 32822 名患有循环系统疾病的患者的血常规和生化检测数据。我们选择了逻辑回归、随机森林、支持向量机、极端梯度提升(XGBoost)和深度神经网络来构建模型。最后,我们使用 SHAP 算法对模型进行解释。

结果

XGBoost 构建的循环系统疾病预测模型表现最佳(AUC:0.9921(0.9911-0.9930);Acc:0.9618(0.9588-0.9645);Sn:0.9690(0.9655-0.9723);Sp:0.9526(0.9477-0.9572);PPV:0.9631(0.9592-0.9668);NPV:0.9600(0.9556-0.9644);MCC:0.9224(0.9165-0.9279);F1 分数:0.9661(0.9634-0.9686))。区分各种循环系统疾病的大多数模型也表现良好,区分扩张型心肌病与其他循环系统疾病的模型性能最佳(AUC:0.9267(0.8663-0.9752))。SHAP 算法的模型解释表明,生化检测中的特征对预测循环系统疾病有重要贡献,如钾(K)、总蛋白(TP)、白蛋白(ALB)和间接胆红素(NBIL)。但对于区分各种循环系统疾病的模型,我们发现红细胞计数(RBC)、K、直接胆红素(DBIL)和葡萄糖(GLU)是区分各种循环系统疾病的前 4 个特征。

结论

本研究使用血常规和生化检测数据中的 50 个特征构建了多个用于诊断各种循环系统疾病的模型。同时,还探索了各种循环系统疾病的独特血液学特征,包括一些代谢相关指标。这项具有成本效益的工作将使更多的人受益,并有助于诊断和预防循环系统疾病。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16e0/11439295/6b8d24daace2/12933_2024_2439_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16e0/11439295/2a76472d6f55/12933_2024_2439_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16e0/11439295/f11a70d7186f/12933_2024_2439_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16e0/11439295/642a5d973d26/12933_2024_2439_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16e0/11439295/c91d27dc67d4/12933_2024_2439_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16e0/11439295/6b8d24daace2/12933_2024_2439_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16e0/11439295/2a76472d6f55/12933_2024_2439_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16e0/11439295/f11a70d7186f/12933_2024_2439_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16e0/11439295/642a5d973d26/12933_2024_2439_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16e0/11439295/c91d27dc67d4/12933_2024_2439_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16e0/11439295/6b8d24daace2/12933_2024_2439_Fig5_HTML.jpg

相似文献

1
Construction of machine learning diagnostic models for cardiovascular pan-disease based on blood routine and biochemical detection data.基于血常规和生化检测数据构建心血管多病种的机器学习诊断模型。
Cardiovasc Diabetol. 2024 Sep 28;23(1):351. doi: 10.1186/s12933-024-02439-0.
2
[Constructing a predictive model for the death risk of patients with septic shock based on supervised machine learning algorithms].基于监督机器学习算法构建脓毒症休克患者死亡风险预测模型
Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Apr;36(4):345-352. doi: 10.3760/cma.j.cn121430-20230930-00832.
3
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
4
Predicting major adverse cardiac events in diabetes and chronic kidney disease: a machine learning study from the Silesia Diabetes-Heart Project.预测糖尿病和慢性肾脏病患者的主要不良心脏事件:西里西亚糖尿病-心脏项目的一项机器学习研究
Cardiovasc Diabetol. 2025 Feb 15;24(1):76. doi: 10.1186/s12933-025-02615-w.
5
Optimizing heart disease diagnosis with advanced machine learning models: a comparison of predictive performance.使用先进机器学习模型优化心脏病诊断:预测性能比较
BMC Cardiovasc Disord. 2025 Mar 22;25(1):212. doi: 10.1186/s12872-025-04627-6.
6
Predicting cerebral infarction and transient ischemic attack in healthy individuals and those with dysmetabolism: a machine learning approach combined with routine blood tests.预测健康个体和代谢异常者的脑梗死和短暂性脑缺血发作:一种结合常规血液检测的机器学习方法
Sci Rep. 2025 Apr 16;15(1):13044. doi: 10.1038/s41598-025-94682-y.
7
Machine learning-based predictive models for perioperative major adverse cardiovascular events in patients with stable coronary artery disease undergoing noncardiac surgery.基于机器学习的预测模型用于接受非心脏手术的稳定冠状动脉疾病患者围手术期主要不良心血管事件的预测
Comput Methods Programs Biomed. 2025 Mar;260:108561. doi: 10.1016/j.cmpb.2024.108561. Epub 2024 Dec 13.
8
Identifying cardiovascular disease risk in the U.S. population using environmental volatile organic compounds exposure: A machine learning predictive model based on the SHAP methodology.利用环境挥发性有机化合物暴露识别美国人群心血管疾病风险:基于 SHAP 方法的机器学习预测模型。
Ecotoxicol Environ Saf. 2024 Nov 1;286:117210. doi: 10.1016/j.ecoenv.2024.117210. Epub 2024 Oct 23.
9
Application of machine learning model in predicting the likelihood of blood transfusion after hip fracture surgery.机器学习模型在预测髋部骨折手术后输血可能性中的应用。
Aging Clin Exp Res. 2023 Nov;35(11):2643-2656. doi: 10.1007/s40520-023-02550-4. Epub 2023 Sep 21.
10
Machine learning algorithms for diabetic kidney disease risk predictive model of Chinese patients with type 2 diabetes mellitus.用于中国2型糖尿病患者糖尿病肾病风险预测模型的机器学习算法
Ren Fail. 2025 Dec;47(1):2486558. doi: 10.1080/0886022X.2025.2486558. Epub 2025 Apr 7.

引用本文的文献

1
Comparative analysis of machine learning algorithms for predicting tibial intramedullary nail length from patient characteristics.基于患者特征预测胫骨髓内钉长度的机器学习算法比较分析
J Orthop Traumatol. 2025 Aug 18;26(1):56. doi: 10.1186/s10195-025-00869-4.
2
Model Prediction of Infection Risk After Glucocorticoid Treatment for Severe Drug-Induced Liver Injury: Globulin is a Core Predictive Factor.重症药物性肝损伤糖皮质激素治疗后感染风险的模型预测:球蛋白是核心预测因素。
Drug Des Devel Ther. 2025 Aug 11;19:6871-6883. doi: 10.2147/DDDT.S532870. eCollection 2025.
3
Development and Validation of An Interpretable Machine Learning-Based Prediction Model of Postpartum Hemorrhage in Placenta Previa Following Cesarean Section: A Multicenter Study.

本文引用的文献

1
Screening and diagnosis of cardiovascular disease using artificial intelligence-enabled cardiac magnetic resonance imaging.人工智能赋能心脏磁共振成像在心血管疾病筛查和诊断中的应用。
Nat Med. 2024 May;30(5):1471-1480. doi: 10.1038/s41591-024-02971-2. Epub 2024 May 13.
2
Report on Cardiovascular Health and Diseases in China 2022: an Updated Summary.《中国心血管健康与疾病报告 2022 概要》发布。
Biomed Environ Sci. 2023 Aug 20;36(8):669-701. doi: 10.3967/bes2023.106.
3
A Systematic Review: Do the Use of Machine Learning, Deep Learning, and Artificial Intelligence Improve Patient Outcomes in Acute Myocardial Ischemia Compared to Clinician-Only Approaches?
剖宫产术后前置胎盘产后出血的可解释机器学习预测模型的开发与验证:一项多中心研究
Reprod Sci. 2025 Aug 12. doi: 10.1007/s43032-025-01937-0.
4
Assessment of risk factors related to early occurrence of deep vein thrombosis after TBI using nomogram model.使用列线图模型评估与创伤性脑损伤后早期发生深静脉血栓形成相关的危险因素。
Sci Rep. 2025 Aug 11;15(1):29313. doi: 10.1038/s41598-025-15287-z.
5
Machine learning and SHAP value interpretation for predicting cardiovascular disease risk in patients with diabetes using dietary antioxidants.利用膳食抗氧化剂预测糖尿病患者心血管疾病风险的机器学习与SHAP值解释
Front Nutr. 2025 Jul 16;12:1612369. doi: 10.3389/fnut.2025.1612369. eCollection 2025.
6
A novel nomogram for survival prediction in renal cell carcinoma patients with brain metastases: an analysis of the SEER database.一种用于预测肾细胞癌脑转移患者生存情况的新型列线图:基于监测、流行病学和最终结果(SEER)数据库的分析
Front Immunol. 2025 Jun 30;16:1572580. doi: 10.3389/fimmu.2025.1572580. eCollection 2025.
7
Predicting carotid atherosclerosis in latent autoimmune diabetes in adult patients using machine learning models: a retrospective study.使用机器学习模型预测成年潜伏性自身免疫性糖尿病患者的颈动脉粥样硬化:一项回顾性研究。
BMC Cardiovasc Disord. 2025 Jul 3;25(1):448. doi: 10.1186/s12872-025-04786-6.
8
Machine learning-based prediction of carotid intima-media thickness progression: a three-year prospective cohort study.基于机器学习的颈动脉内膜中层厚度进展预测:一项为期三年的前瞻性队列研究。
Front Med (Lausanne). 2025 Jun 12;12:1593662. doi: 10.3389/fmed.2025.1593662. eCollection 2025.
9
Integration of Bulk and Single-Cell Transcriptomics Reveals BCL2L14 as a Novel IGKC+ T Cell-Associated Therapeutic Target in Breast Cancer.整合批量和单细胞转录组学揭示BCL2L14是乳腺癌中一种新型的IGKC + T细胞相关治疗靶点。
J Inflamm Res. 2025 Jun 5;18:7215-7234. doi: 10.2147/JIR.S523147. eCollection 2025.
10
Application of machine learning for the analysis of peripheral blood biomarkers in oral mucosal diseases: a cross-sectional study.机器学习在口腔黏膜疾病外周血生物标志物分析中的应用:一项横断面研究。
BMC Oral Health. 2025 May 10;25(1):703. doi: 10.1186/s12903-025-06095-y.
一项系统评价:与仅由临床医生参与的方法相比,机器学习、深度学习和人工智能的应用是否能改善急性心肌缺血患者的治疗结局?
Cureus. 2023 Aug 5;15(8):e43003. doi: 10.7759/cureus.43003. eCollection 2023 Aug.
4
Evolutionary Action-Machine Learning Model Identifies Candidate Genes Associated With Early-Onset Coronary Artery Disease.进化作用-机器学习模型鉴定与早发性冠状动脉疾病相关的候选基因。
J Am Heart Assoc. 2023 Sep 5;12(17):e029103. doi: 10.1161/JAHA.122.029103. Epub 2023 Aug 29.
5
Dietary, macronutrient, micronutrient, and nutrigenetic factors impacting cardiovascular risk markers apolipoprotein B and apolipoprotein A1: a narrative review.饮食、宏量营养素、微量营养素和营养遗传因素对心血管风险标志物载脂蛋白 B 和载脂蛋白 A1 的影响:叙述性综述。
Nutr Rev. 2024 Jun 10;82(7):949-962. doi: 10.1093/nutrit/nuad102.
6
Machine learning algorithms to predict major adverse cardiovascular events in patients with diabetes.机器学习算法预测糖尿病患者的主要不良心血管事件。
Comput Biol Med. 2023 Sep;164:107289. doi: 10.1016/j.compbiomed.2023.107289. Epub 2023 Aug 1.
7
Development of machine learning-based models to predict 10-year risk of cardiovascular disease: a prospective cohort study.基于机器学习的模型预测心血管疾病 10 年风险的开发:一项前瞻性队列研究。
Stroke Vasc Neurol. 2023 Dec 29;8(6):475-485. doi: 10.1136/svn-2023-002332.
8
Heart failure with preserved ejection fraction phenogroup classification using machine learning.采用机器学习对射血分数保留型心力衰竭进行表型分组分类。
ESC Heart Fail. 2023 Jun;10(3):2019-2030. doi: 10.1002/ehf2.14368. Epub 2023 Apr 12.
9
A computational framework of routine test data for the cost-effective chronic disease prediction.用于具有成本效益的慢性病预测的常规测试数据计算框架。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad054.
10
Long-Term Coronary Artery Disease Risk Prediction with Machine Learning Models.基于机器学习模型的长期冠状动脉疾病风险预测。
Sensors (Basel). 2023 Jan 20;23(3):1193. doi: 10.3390/s23031193.