Department of Cardiovascular medicine, Affiliated Fuzhou First Hospital of Fujian Medical University, Fuzhou, Fujian, China.
The Third Clinical Medical College, Fujian Medical University, Fuzhou, Fujian, China.
Clin Cardiol. 2023 Oct;46(10):1234-1243. doi: 10.1002/clc.24104. Epub 2023 Jul 31.
The purpose of this study was to develop and validate a machine learning (ML) based prediction model for the risk of heart failure (HF) in patients with prediabetes or diabetes.
We used 3527 subjects aged 40 years and older with a prior diagnosis of prediabetes or diabetes from the National Health and Nutrition Examination Survey (NHANES) from 2007 to 2018. The search for independent risk variables linked to HF was conducted using univariate and multivariable logistic regression analysis. The 3527 subjects were randomly divided into training set and validation set in a 7:3 ratio. Five ML models were built on the training set using five ML algorithms, including random forest (RF), and then validated on the validation set. Receiver operating characteristic (ROC) curves, calibration curves, and decision curve analysis and Bootstrap resampling method were used to measure the predictive performance of the five ML models.
Multivariate logistic regression analysis showed that age, poverty-to-income ratio, myocardial infarction condition, coronary heart disease condition, chest pain condition, and glucose-lowering medication use were independent predictors of HF. By comparing the performance of the five ML models, the RF model (AUC = 0.978) was the best prediction model.
The risk of HF in middle-aged and elderly patients with prediabetes or diabetes can be accurately predicted using ML models. The best prediction performance is presented by RF model, which can assist doctors in making clinical decisions.
本研究旨在开发和验证一种基于机器学习(ML)的预测模型,用于预测患有糖尿病前期或糖尿病的患者发生心力衰竭(HF)的风险。
我们使用了来自 2007 年至 2018 年国家健康和营养检查调查(NHANES)的 3527 名年龄在 40 岁及以上、有糖尿病前期或糖尿病既往诊断的患者。使用单变量和多变量逻辑回归分析寻找与 HF 相关的独立风险变量。将 3527 名患者以 7:3 的比例随机分为训练集和验证集。在训练集上使用五种 ML 算法(包括随机森林(RF))构建了五个 ML 模型,然后在验证集上进行验证。使用受试者工作特征(ROC)曲线、校准曲线、决策曲线分析和 Bootstrap 重采样方法来衡量五种 ML 模型的预测性能。
多变量逻辑回归分析表明,年龄、贫困收入比、心肌梗死情况、冠心病情况、胸痛情况和降血糖药物使用是 HF 的独立预测因素。通过比较五种 ML 模型的性能,RF 模型(AUC=0.978)是最佳预测模型。
ML 模型可准确预测中老年糖尿病前期或糖尿病患者发生 HF 的风险。RF 模型表现出最佳的预测性能,可帮助医生做出临床决策。