机器学习模型预测慢性肾脏病患者心血管疾病
Machine learning model for cardiovascular disease prediction in patients with chronic kidney disease.
机构信息
Department of Nephrology, First Medical Center of Chinese PLA General Hospital, National Key Laboratory of Kidney Diseases, National Clinical Research Center for Kidney Diseases, Beijing Key Laboratory of Kidney Diseases Research, Beijing, China.
School of Clinical Medicine, Guangdong Pharmaceutical University, Guangzhou, China.
出版信息
Front Endocrinol (Lausanne). 2024 May 28;15:1390729. doi: 10.3389/fendo.2024.1390729. eCollection 2024.
INTRODUCTION
Cardiovascular disease (CVD) is the leading cause of death in patients with chronic kidney disease (CKD). This study aimed to develop CVD risk prediction models using machine learning to support clinical decision making and improve patient prognosis.
METHODS
Electronic medical records from patients with CKD at a single center from 2015 to 2020 were used to develop machine learning models for the prediction of CVD. Least absolute shrinkage and selection operator (LASSO) regression was used to select important features predicting the risk of developing CVD. Seven machine learning classification algorithms were used to build models, which were evaluated by receiver operating characteristic curves, accuracy, sensitivity, specificity, and F1-score, and Shapley Additive explanations was used to interpret the model results. CVD was defined as composite cardiovascular events including coronary heart disease (coronary artery disease, myocardial infarction, angina pectoris, and coronary artery revascularization), cerebrovascular disease (hemorrhagic stroke and ischemic stroke), deaths from all causes (cardiovascular deaths, non-cardiovascular deaths, unknown cause of death), congestive heart failure, and peripheral artery disease (aortic aneurysm, aortic or other peripheral arterial revascularization). A cardiovascular event was a composite outcome of multiple cardiovascular events, as determined by reviewing medical records.
RESULTS
This study included 8,894 patients with CKD, with a composite CVD event incidence of 25.9%; a total of 2,304 patients reached this outcome. LASSO regression identified eight important features for predicting the risk of CKD developing into CVD: age, history of hypertension, sex, antiplatelet drugs, high-density lipoprotein, sodium ions, 24-h urinary protein, and estimated glomerular filtration rate. The model developed using Extreme Gradient Boosting in the test set had an area under the curve of 0.89, outperforming the other models, indicating that it had the best CVD predictive performance.
CONCLUSION
This study established a CVD risk prediction model for patients with CKD, based on routine clinical diagnostic and treatment data, with good predictive accuracy. This model is expected to provide a scientific basis for the management and treatment of patients with CKD.
简介
心血管疾病(CVD)是慢性肾脏病(CKD)患者死亡的主要原因。本研究旨在使用机器学习开发 CVD 风险预测模型,以支持临床决策并改善患者预后。
方法
使用来自 2015 年至 2020 年单中心 CKD 患者的电子病历开发 CVD 预测的机器学习模型。使用最小绝对收缩和选择算子(LASSO)回归选择预测 CVD 风险的重要特征。使用七种机器学习分类算法构建模型,通过接收者操作特征曲线、准确性、敏感性、特异性和 F1 评分评估模型,并使用 Shapley 加性解释解释模型结果。CVD 定义为包括冠心病(冠状动脉疾病、心肌梗死、心绞痛和冠状动脉血运重建)、脑血管疾病(出血性中风和缺血性中风)、全因死亡(心血管死亡、非心血管死亡、死因不明)、充血性心力衰竭和外周动脉疾病(主动脉瘤、主动脉或其他外周动脉血运重建)在内的复合心血管事件。心血管事件是通过审查病历确定的多种心血管事件的复合结局。
结果
本研究纳入了 8894 例 CKD 患者,复合 CVD 事件发生率为 25.9%;共有 2304 例患者达到这一结果。LASSO 回归确定了预测 CKD 发展为 CVD 风险的八个重要特征:年龄、高血压病史、性别、抗血小板药物、高密度脂蛋白、钠离子、24 小时尿蛋白和估计肾小球滤过率。在测试集中使用极端梯度增强开发的模型曲线下面积为 0.89,优于其他模型,表明其具有最佳的 CVD 预测性能。
结论
本研究基于常规临床诊断和治疗数据为 CKD 患者建立了 CVD 风险预测模型,具有良好的预测准确性。该模型有望为 CKD 患者的管理和治疗提供科学依据。