• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的2型糖尿病合并冠心病预测模型及基于可解释SHAP的特征分析

Machine learning-based models to predict type 2 diabetes combined with coronary heart disease and feature analysis-based on interpretable SHAP.

作者信息

Ji Yijian, Shang Hongyan, Yi Jing, Zang Wenhui, Cao Wenjun

机构信息

Academy of Public Health, Shanxi Medical University, Jinzhong, Shanxi, China.

Academy of Medical Sciences, Shanxi Medical University, Jinzhong, Shanxi, China.

出版信息

Acta Diabetol. 2025 Apr 1. doi: 10.1007/s00592-025-02496-1.

DOI:10.1007/s00592-025-02496-1
PMID:40167635
Abstract

BACKGROUND

Type 2 diabetes and coronary heart disease exhibit heightened prevalence in the Chinese population, posing as leading causes of mortality. The combination of diabetes and coronary heart disease, due to its challenging diagnosis and poor prognosis, imposes a significant disease burden. In recent years, machine learning has frequently been employed in diagnostic applications within medical fields; however, predictive models for type 2 diabetes complicated by coronary heart disease have been confronted with issues such as lower predictive performance and interference from other comorbidities during prediction.

METHODS

This study enhances the predictive accuracy, sensitivity, specificity, F1 score, and AUC of models forecasting the coexistence of diabetes and coronary heart disease. We developed an advanced prediction model using XGBoost combined with SHAP for feature analysis. Through comparative feature selection, hyperparameter optimization, and computational efficiency analysis, we identified optimal conditions for model performance. External validation with independent datasets confirmed the model's robustness and generalizability, supporting its potential implementation in clinical practice.

RESULTS

This study compared three models-Random Forest, LightGBM, and XGBoost-and found that XGBoost exhibited superior performance in both efficacy and computational efficiency. The accuracy (Acc) of the XGBoost model was 0.8910, which improved to 0.8942 after hyperparameter tuning. External validation using datasets from Pingyang Hospital and Heji Hospital in Shanxi Province, China, yielded an AUC of 0.7897, demonstrating robust generalizability. By integrating SHAP (SHapley Additive exPlanations) for interpretability, our study identified bilirubin levels, basophil count, cholesterol levels, and age as key features for predicting the coexistence of type 2 diabetes mellitus (T2DM) and coronary heart disease (CHD). These findings are seamlessly consistent with the feature importance rankings determined by the XGBoost algorithm. The model demonstrates moderate predictive performance (AUC = 0.7879 in external validation) with practical interpretability, offering potential utility in improving diagnostic efficiency for T2DM-CHD comorbidity in resource-limited settings. However, its clinical implementation requires further validation in diverse populations.

摘要

背景

2型糖尿病和冠心病在中国人群中的患病率不断上升,是主要的死亡原因。糖尿病和冠心病并存,由于其诊断具有挑战性且预后较差,带来了重大的疾病负担。近年来,机器学习在医学领域的诊断应用中频繁使用;然而,预测2型糖尿病合并冠心病的模型面临着预测性能较低以及预测过程中受到其他合并症干扰等问题。

方法

本研究提高了预测糖尿病和冠心病并存的模型的预测准确性、敏感性、特异性、F1分数和AUC。我们使用XGBoost结合SHAP进行特征分析,开发了一种先进的预测模型。通过比较特征选择、超参数优化和计算效率分析,我们确定了模型性能的最佳条件。使用独立数据集进行外部验证,证实了该模型的稳健性和通用性,支持其在临床实践中的潜在应用。

结果

本研究比较了三种模型——随机森林、LightGBM和XGBoost——发现XGBoost在有效性和计算效率方面均表现出卓越性能。XGBoost模型的准确率(Acc)为0.8910,经过超参数调整后提高到0.8942。使用中国山西省平阳医院和河津医院的数据集进行外部验证,得出AUC为0.7897,证明了强大的通用性。通过整合SHAP(SHapley Additive exPlanations)以实现可解释性,我们的研究确定胆红素水平、嗜碱性粒细胞计数、胆固醇水平和年龄是预测2型糖尿病(T2DM)和冠心病(CHD)并存的关键特征。这些发现与XGBoost算法确定的特征重要性排名完全一致。该模型具有适度的预测性能(外部验证中AUC = 0.7879)和实际可解释性,在资源有限的环境中提高T2DM-CHD合并症的诊断效率方面具有潜在用途。然而,其临床应用需要在不同人群中进一步验证。

相似文献

1
Machine learning-based models to predict type 2 diabetes combined with coronary heart disease and feature analysis-based on interpretable SHAP.基于机器学习的2型糖尿病合并冠心病预测模型及基于可解释SHAP的特征分析
Acta Diabetol. 2025 Apr 1. doi: 10.1007/s00592-025-02496-1.
2
Prediction of sepsis mortality in ICU patients using machine learning methods.使用机器学习方法预测 ICU 患者的败血症死亡率。
BMC Med Inform Decis Mak. 2024 Aug 16;24(1):228. doi: 10.1186/s12911-024-02630-z.
3
Development of Enhanced Machine Learning Models for Predicting Type 2 Diabetes Mellitus Using Heart Rate Variability: A Retrospective Study.利用心率变异性预测2型糖尿病的增强机器学习模型的开发:一项回顾性研究
Cureus. 2025 Mar 21;17(3):e80933. doi: 10.7759/cureus.80933. eCollection 2025 Mar.
4
Development of interpretable machine learning models to predict in-hospital prognosis of acute heart failure patients.开发可解释的机器学习模型以预测急性心力衰竭患者的院内预后。
ESC Heart Fail. 2024 Oct;11(5):2798-2812. doi: 10.1002/ehf2.14834. Epub 2024 May 15.
5
Development and validation of a machine learning-based predictive model for assessing the 90-day prognostic outcome of patients with spontaneous intracerebral hemorrhage.基于机器学习的预测模型评估自发性脑出血患者 90 天预后结局的开发与验证。
J Transl Med. 2024 Mar 4;22(1):236. doi: 10.1186/s12967-024-04896-3.
6
Non-invasive Prediction of Lymph Node Metastasis in NSCLC Using Clinical, Radiomics, and Deep Learning Features From F-FDG PET/CT Based on Interpretable Machine Learning.基于可解释机器学习,利用F-FDG PET/CT的临床、影像组学和深度学习特征对非小细胞肺癌淋巴结转移进行无创预测
Acad Radiol. 2025 Mar;32(3):1645-1655. doi: 10.1016/j.acra.2024.11.037. Epub 2024 Dec 10.
7
Integrating SHAP analysis with machine learning to predict postpartum hemorrhage in vaginal births.将SHAP分析与机器学习相结合以预测阴道分娩中的产后出血。
BMC Pregnancy Childbirth. 2025 May 3;25(1):529. doi: 10.1186/s12884-025-07633-w.
8
Interpretable machine learning for 28-day all-cause in-hospital mortality prediction in critically ill patients with heart failure combined with hypertension: A retrospective cohort study based on medical information mart for intensive care database-IV and eICU databases.用于预测心力衰竭合并高血压重症患者28天全因院内死亡率的可解释机器学习:一项基于重症监护医学信息集市数据库-IV和电子重症监护病房数据库的回顾性队列研究
Front Cardiovasc Med. 2022 Oct 12;9:994359. doi: 10.3389/fcvm.2022.994359. eCollection 2022.
9
Detecting severe coronary artery stenosis in T2DM patients with NAFLD using cardiac fat radiomics-based machine learning.利用基于心脏脂肪影像组学的机器学习检测非酒精性脂肪性肝病2型糖尿病患者的严重冠状动脉狭窄
Sci Rep. 2025 Feb 25;15(1):6788. doi: 10.1038/s41598-025-91523-w.
10
Establishment and validation of a heart failure risk prediction model for elderly patients after coronary rotational atherectomy based on machine learning.基于机器学习的老年患者冠状动脉旋磨术后心力衰竭风险预测模型的建立与验证
PeerJ. 2024 Jan 31;12:e16867. doi: 10.7717/peerj.16867. eCollection 2024.

本文引用的文献

1
A prospective study of waist circumference trajectories and incident cardiovascular disease in China: the Kailuan Cohort Study.中国腰围轨迹与心血管疾病发病的前瞻性研究:开滦队列研究
Am J Clin Nutr. 2021 Feb;113(2):338-347. doi: 10.1093/ajcn/nqaa331.
2
Cardiovascular complications in a diabetes prediction model using machine learning: a systematic review.基于机器学习的糖尿病预测模型中的心血管并发症:系统评价。
Cardiovasc Diabetol. 2023 Jan 19;22(1):13. doi: 10.1186/s12933-023-01741-7.
3
Associations of genetically predicted IL-6 signaling with cardiovascular disease risk across population subgroups.
遗传预测的 IL-6 信号与人群亚组心血管疾病风险的关联。
BMC Med. 2022 Aug 11;20(1):245. doi: 10.1186/s12916-022-02446-6.
4
Acute coronary syndromes in diabetic patients, outcome, revascularization, and antithrombotic therapy.糖尿病患者的急性冠脉综合征、结局、血运重建和抗栓治疗。
Biomed Pharmacother. 2022 Apr;148:112772. doi: 10.1016/j.biopha.2022.112772. Epub 2022 Mar 1.
5
Associations of Glycemic Index and Glycemic Load with Cardiovascular Disease: Updated Evidence from Meta-analysis and Cohort Studies.血糖指数和血糖负荷与心血管疾病的关系:来自荟萃分析和队列研究的更新证据。
Curr Cardiol Rep. 2022 Mar;24(3):141-161. doi: 10.1007/s11886-022-01635-2. Epub 2022 Feb 4.
6
Heart disease prediction using supervised machine learning algorithms: Performance analysis and comparison.基于监督机器学习算法的心脏病预测:性能分析与比较。
Comput Biol Med. 2021 Sep;136:104672. doi: 10.1016/j.compbiomed.2021.104672. Epub 2021 Jul 21.
7
Examining the heterogeneity inexcess risks of coronary heart disease, stroke, dialysis, and lower extremity amputation associated with type 2 diabetes mellitus across demographic subgroups in an Asian population: A population-based matched cohort study.在亚洲人群中,基于人群的匹配队列研究考察了 2 型糖尿病在不同人口亚组中与冠心病、中风、透析和下肢截肢相关的超额风险的异质性。
Diabetes Res Clin Pract. 2021 Jan;171:108551. doi: 10.1016/j.diabres.2020.108551. Epub 2020 Nov 22.
8
Obesity and cardiovascular disease risk among Africans residing in Europe and Africa: the RODAM study.非洲裔欧洲居民和非洲居民的肥胖与心血管疾病风险:RODAM 研究。
Obes Res Clin Pract. 2020 Mar-Apr;14(2):151-157. doi: 10.1016/j.orcp.2020.01.007. Epub 2020 Feb 12.
9
Risk and management of pre-diabetes.糖尿病前期的风险与管理。
Eur J Prev Cardiol. 2019 Dec;26(2_suppl):47-54. doi: 10.1177/2047487319880041.
10
Serum albumin and risk of cardiovascular events in primary and secondary prevention: a systematic review of observational studies and Bayesian meta-regression analysis.血清白蛋白与一级和二级预防中心血管事件风险:观察性研究的系统评价和贝叶斯元回归分析
Intern Emerg Med. 2020 Jan;15(1):135-143. doi: 10.1007/s11739-019-02204-2. Epub 2019 Oct 11.