UCLA David Geffen School of Medicine, Los Angeles, CA, USA.
Department of Computer Science, UCLA, Los Angeles, CA, USA.
ESC Heart Fail. 2024 Oct;11(5):2490-2498. doi: 10.1002/ehf2.14796. Epub 2024 Apr 18.
Existing risk prediction models for hospitalized heart failure patients are limited. We identified patients hospitalized with a diagnosis of heart failure between 7 May 2013 and 26 April 2022 from a large academic, quaternary care medical centre (training cohort). Demographics, medical comorbidities, vitals, and labs were collected and were used to construct random forest machine learning models to predict in-hospital mortality. Models were compared with logistic regression, and to commonly used heart failure risk scores. The models were subsequently validated in patients hospitalized with a diagnosis of heart failure from a second academic, community medical centre (validation cohort). The entire cohort comprised 21 802 patients, of which 14 539 were in the training cohort and 7263 were in the validation cohort. The median age (25th-75th percentile) was 70 (58-82) for the entire cohort, 43.2% were female, and 6.7% experienced inpatient mortality. In the overall cohort, 7621 (35.0%) patients had heart failure with reduced ejection fraction (EF ≤ 40%), 1271 (5.8%) had heart failure with mildly reduced EF (EF 41-49%), and 12 910 (59.2%) had heart failure with preserved EF (EF ≥ 50%). Random forest models in the validation cohort demonstrated a c-statistic (95% confidence interval) of 0.96 (0.95-0.97), sensitivity (SN) of 87.3%, and specificity (SP) of 90.6% for the prediction of in-hospital mortality. Models for those with HFrEF demonstrated a c-statistic of 0.96 (0.94-0.98), SN 88.2%, and SP 91.0%, and those for patients with HFpEF showed a c-statistic of 0.95 (0.93-0.97), SN 87.4%, and SP 89.5% for predicting in-hospital mortality. The random forest model significantly outperformed logistic regression (c-statistic 0.87, SN 75.9%, and SP 86.9%), and current existing risk scores including the Acute Decompensated Heart Failure National Registry risk score (c-statistic of 0.70, SN 69%, and SP 62%), and the Get With the Guidelines-Heart Failure risk score (c-statistic 0.69, SN 67%, and SP 63%); P < 0.001 for comparison. Machine learning models built from commonly recorded patient information can accurately predict in-hospital mortality among patients hospitalized with a diagnosis of heart failure.
现有的住院心力衰竭患者风险预测模型存在局限性。我们从一家大型学术、四级保健医疗中心(训练队列)中确定了 2013 年 5 月 7 日至 2022 年 4 月 26 日期间因心力衰竭住院的患者。收集了人口统计学、合并症、生命体征和实验室数据,并用于构建随机森林机器学习模型,以预测住院死亡率。将模型与逻辑回归和常用心力衰竭风险评分进行比较。随后,在第二家学术、社区医疗中心(验证队列)住院的心力衰竭患者中对模型进行验证。整个队列包括 21802 名患者,其中 14539 名在训练队列中,7263 名在验证队列中。整个队列的中位年龄(25 至 75 百分位数)为 70(58 至 82),43.2%为女性,6.7%发生住院死亡。在整个队列中,7621 名(35.0%)患者有心衰射血分数降低(EF≤40%),1271 名(5.8%)患者有心衰射血分数轻度降低(EF41-49%),12910 名(59.2%)患者有心衰射血分数保留(EF≥50%)。验证队列中的随机森林模型显示,住院死亡率预测的 C 统计量(95%置信区间)为 0.96(0.95-0.97),敏感性(SN)为 87.3%,特异性(SP)为 90.6%。HFrEF 患者模型的 C 统计量为 0.96(0.94-0.98),SN 为 88.2%,SP 为 91.0%,HFpEF 患者模型的 C 统计量为 0.95(0.93-0.97),SN 为 87.4%,SP 为 89.5%。随机森林模型明显优于逻辑回归(C 统计量 0.87,SN 75.9%,SP 86.9%),以及目前现有的风险评分,包括急性失代偿性心力衰竭国家登记风险评分(C 统计量 0.70,SN 69%,SP 62%)和 Get With the Guidelines-心力衰竭风险评分(C 统计量 0.69,SN 67%,SP 63%);P<0.001 用于比较。基于患者常见记录信息构建的机器学习模型可以准确预测因心力衰竭住院患者的住院死亡率。