Department of Biomedical Informatics, Columbia University, New York, New York, USA.
Department of Translational Data Science and Informatics, Geisinger, Danville, Pennsylvania, USA.
J Am Med Inform Assoc. 2021 Jul 14;28(7):1480-1488. doi: 10.1093/jamia/ocab029.
Coronavirus disease 2019 (COVID-19) patients are at risk for resource-intensive outcomes including mechanical ventilation (MV), renal replacement therapy (RRT), and readmission. Accurate outcome prognostication could facilitate hospital resource allocation. We develop and validate predictive models for each outcome using retrospective electronic health record data for COVID-19 patients treated between March 2 and May 6, 2020.
For each outcome, we trained 3 classes of prediction models using clinical data for a cohort of SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2)-positive patients (n = 2256). Cross-validation was used to select the best-performing models per the areas under the receiver-operating characteristic and precision-recall curves. Models were validated using a held-out cohort (n = 855). We measured each model's calibration and evaluated feature importances to interpret model output.
The predictive performance for our selected models on the held-out cohort was as follows: area under the receiver-operating characteristic curve-MV 0.743 (95% CI, 0.682-0.812), RRT 0.847 (95% CI, 0.772-0.936), readmission 0.871 (95% CI, 0.830-0.917); area under the precision-recall curve-MV 0.137 (95% CI, 0.047-0.175), RRT 0.325 (95% CI, 0.117-0.497), readmission 0.504 (95% CI, 0.388-0.604). Predictions were well calibrated, and the most important features within each model were consistent with clinical intuition.
Our models produce performant, well-calibrated, and interpretable predictions for COVID-19 patients at risk for the target outcomes. They demonstrate the potential to accurately estimate outcome prognosis in resource-constrained care sites managing COVID-19 patients.
We develop and validate prognostic models targeting MV, RRT, and readmission for hospitalized COVID-19 patients which produce accurate, interpretable predictions. Additional external validation studies are needed to further verify the generalizability of our results.
2019 年冠状病毒病(COVID-19)患者有发生包括机械通气(MV)、肾脏替代治疗(RRT)和再入院在内的资源密集型结局的风险。准确的预后预测可以帮助医院分配资源。我们使用 2020 年 3 月 2 日至 5 月 6 日期间治疗的 COVID-19 患者的回顾性电子健康记录数据,为每种结局开发和验证预测模型。
对于每种结局,我们使用 SARS-CoV-2(严重急性呼吸综合征冠状病毒 2)阳性患者队列的临床数据训练了 3 类预测模型(n=2256)。使用接受者操作特征曲线和精度-召回曲线下面积来选择每个模型的最佳表现。使用保留队列(n=855)验证模型。我们测量了每个模型的校准程度,并评估了特征重要性来解释模型输出。
在保留队列中,我们选择的模型的预测性能如下:MV 的接受者操作特征曲线下面积为 0.743(95%置信区间,0.682-0.812),RRT 为 0.847(95%置信区间,0.772-0.936),再入院为 0.871(95%置信区间,0.830-0.917);MV 的精度-召回曲线下面积为 0.137(95%置信区间,0.047-0.175),RRT 为 0.325(95%置信区间,0.117-0.497),再入院为 0.504(95%置信区间,0.388-0.604)。预测结果校准良好,每个模型中的最重要特征与临床直觉一致。
我们的模型为有目标结局风险的 COVID-19 患者生成了性能良好、校准良好且可解释的预测。它们有可能在资源有限的 COVID-19 患者管理场所中准确估计预后。
我们为住院 COVID-19 患者开发并验证了针对 MV、RRT 和再入院的预测模型,这些模型提供了准确且可解释的预测。需要进一步的外部验证研究来进一步验证我们结果的普遍性。