Department of Biostatistics, School of Medicine, Shiraz University of Medical Sciences, Shiraz, Iran.
Emergency Medicine Research Center, Shiraz University of Medical Sciences, Shiraz, Iran.
BMC Public Health. 2022 Jan 5;22(1):10. doi: 10.1186/s12889-021-12383-3.
Narrowing a large set of features to a smaller one can improve our understanding of the main risk factors for in-hospital mortality in patients with COVID-19. This study aimed to derive a parsimonious model for predicting overall survival (OS) among re-infected COVID-19 patients using machine-learning algorithms.
The retrospective data of 283 re-infected COVID-19 patients admitted to twenty-six medical centers (affiliated with Shiraz University of Medical Sciences) from 10 June to 26 December 2020 were reviewed and analyzed. An elastic-net regularized Cox proportional hazards (PH) regression and model approximation via backward elimination were utilized to optimize a predictive model of time to in-hospital death. The model was further reduced to its core features to maximize simplicity and generalizability.
The empirical in-hospital mortality rate among the re-infected COVID-19 patients was 9.5%. In addition, the mortality rate among the intubated patients was 83.5%. Using the Kaplan-Meier approach, the OS (95% CI) rates for days 7, 14, and 21 were 87.5% (81.6-91.6%), 78.3% (65.0-87.0%), and 52.2% (20.3-76.7%), respectively. The elastic-net Cox PH regression retained 8 out of 35 candidate features of death. Transfer by Emergency Medical Services (EMS) (HR=3.90, 95% CI: 1.63-9.48), SpO≤85% (HR=8.10, 95% CI: 2.97-22.00), increased serum creatinine (HR=1.85, 95% CI: 1.48-2.30), and increased white blood cells (WBC) count (HR=1.10, 95% CI: 1.03-1.15) were associated with higher in-hospital mortality rates in the re-infected COVID-19 patients.
The results of the machine-learning analysis demonstrated that transfer by EMS, profound hypoxemia (SpO≤85%), increased serum creatinine (more than 1.6 mg/dL), and increased WBC count (more than 8.5 (×10 cells/L)) reduced the OS of the re-infected COVID-19 patients. We recommend that future machine-learning studies should further investigate these relationships and the associated factors in these patients for a better prediction of OS.
将大量特征缩小到更小的特征集可以帮助我们更好地了解 COVID-19 住院患者死亡的主要风险因素。本研究旨在使用机器学习算法为再感染 COVID-19 患者的总生存率(OS)建立一个简洁的预测模型。
回顾性分析了 2020 年 6 月 10 日至 12 月 26 日来自 26 家医疗中心(隶属于 Shiraz 医科大学)的 283 例再感染 COVID-19 患者的病历资料。利用弹性网络正则化 Cox 比例风险(PH)回归和向后消除法对预测住院死亡时间的模型进行优化。进一步简化模型以最大限度地提高其简洁性和通用性。
再感染 COVID-19 患者的院内死亡率为 9.5%。此外,插管患者的死亡率为 83.5%。Kaplan-Meier 方法显示,第 7、14 和 21 天的 OS(95%CI)率分别为 87.5%(81.6-91.6%)、78.3%(65.0-87.0%)和 52.2%(20.3-76.7%)。弹性网络 Cox PH 回归保留了死亡的 35 个候选特征中的 8 个。通过紧急医疗服务(EMS)转移(HR=3.90,95%CI:1.63-9.48)、SpO≤85%(HR=8.10,95%CI:2.97-22.00)、血清肌酐升高(HR=1.85,95%CI:1.48-2.30)和白细胞计数升高(HR=1.10,95%CI:1.03-1.15)与再感染 COVID-19 患者的住院死亡率升高相关。
机器学习分析结果表明,通过 EMS 转移、严重低氧血症(SpO≤85%)、血清肌酐升高(超过 1.6mg/dL)和白细胞计数升高(超过 8.5(×10 细胞/L))会降低再感染 COVID-19 患者的 OS。我们建议未来的机器学习研究应进一步探讨这些关系以及这些患者的相关因素,以更好地预测 OS。