Harry Butler Institute, Murdoch University, Murdoch, Australia.
Department of Information Technology, Murdoch University, Murdoch, Australia.
Respir Res. 2023 Mar 13;24(1):79. doi: 10.1186/s12931-023-02386-6.
We applied machine learning (ML) algorithms to generate a risk prediction tool [Collaboration for Risk Evaluation in COVID-19 (CORE-COVID-19)] for predicting the composite of 30-day endotracheal intubation, intravenous administration of vasopressors, or death after COVID-19 hospitalization and compared it with the existing risk scores.
This is a retrospective study of adults hospitalized with COVID-19 from March 2020 to February 2021. Patients, each with 92 variables, and one composite outcome underwent feature selection process to identify the most predictive variables. Selected variables were modeled to build four ML algorithms (artificial neural network, support vector machine, gradient boosting machine, and Logistic regression) and an ensemble model to generate a CORE-COVID-19 model to predict the composite outcome and compared with existing risk prediction scores. The net benefit for clinical use of each model was assessed by decision curve analysis.
Of 1796 patients, 278 (15%) patients reached primary outcome. Six most predictive features were identified. Four ML algorithms achieved comparable discrimination (P > 0.827) with c-statistics ranged 0.849-0.856, calibration slopes 0.911-1.173, and Hosmer-Lemeshow P > 0.141 in validation dataset. These 6-variable fitted CORE-COVID-19 model revealed a c-statistic of 0.880, which was significantly (P < 0.04) higher than ISARIC-4C (0.751), CURB-65 (0.735), qSOFA (0.676), and MEWS (0.674) for outcome prediction. The net benefit of the CORE-COVID-19 model was greater than that of the existing risk scores.
The CORE-COVID-19 model accurately assigned 88% of patients who potentially progressed to 30-day composite events and revealed improved performance over existing risk scores, indicating its potential utility in clinical practice.
我们应用机器学习(ML)算法生成了一种风险预测工具[协作 COVID-19 风险评估(CORE-COVID-19)],用于预测 COVID-19 住院后 30 天内气管插管、静脉注射血管加压素或死亡的综合结果,并将其与现有风险评分进行比较。
这是一项回顾性研究,纳入了 2020 年 3 月至 2021 年 2 月期间因 COVID-19 住院的成年人。每位患者有 92 个变量和一个综合结局,经过特征选择过程以确定最具预测性的变量。选择的变量用于构建四种 ML 算法(人工神经网络、支持向量机、梯度提升机和逻辑回归)和一个集成模型,以生成 CORE-COVID-19 模型来预测综合结局,并与现有风险预测评分进行比较。通过决策曲线分析评估每个模型在临床应用中的净收益。
在 1796 名患者中,278 名(15%)患者达到了主要结局。确定了六个最具预测性的特征。四种 ML 算法在验证数据集中的判别能力相当(P > 0.827),C 统计量范围为 0.849-0.856,校准斜率为 0.911-1.173,Hosmer-Lemeshow P > 0.141。这个由 6 个变量拟合的 CORE-COVID-19 模型显示出的 C 统计量为 0.880,与 ISARIC-4C(0.751)、CURB-65(0.735)、qSOFA(0.676)和 MEWS(0.674)相比,对结局的预测有显著提高(P < 0.04)。CORE-COVID-19 模型的净收益大于现有风险评分。
CORE-COVID-19 模型准确地为 88%可能进展为 30 天综合事件的患者分配了风险,并显示出优于现有风险评分的性能,表明其在临床实践中的潜在应用价值。