Electrical and Software Engineering, University of Calgary, Calgary, AB, Canada.
Community Health Sciences, University of Calgary, Calgary, AB, Canada.
Sci Rep. 2023 Oct 18;13(1):17708. doi: 10.1038/s41598-023-43943-9.
In this study, we aimed to identify the factors that were associated with mortality among continuing care residents in Alberta, during the coronavirus disease 2019 (COVID-19) pandemic. We achieved this by leveraging and linking various administrative datasets together. Then, we examined pre-processing methods in terms of prediction performance. Finally, we developed several machine learning models and compared the results of these models in terms of performance. We conducted a retrospective cohort study of all continuing care residents in Alberta, Canada, from March 1, 2020, to March 31, 2021. We used a univariable and a multivariable logistic regression (LR) model to identify predictive factors of 60-day all-cause mortality by estimating odds ratios (ORs) with a 95% confidence interval. To determine the best sensitivity-specificity cut-off point, the Youden index was employed. We developed several machine learning models to determine the best model regarding performance. In this cohort study, increased age, male sex, symptoms, previous admissions, and some specific comorbidities were associated with increased mortality. Machine learning and pre-processing approaches offer a potentially valuable method for improving risk prediction for mortality, but more work is needed to show improvement beyond standard risk factors.
在这项研究中,我们旨在确定与 2019 年冠状病毒病(COVID-19)大流行期间艾伯塔省长期护理居民死亡率相关的因素。我们通过利用和链接各种行政数据集来实现这一目标。然后,我们根据预测性能检查了预处理方法。最后,我们开发了几个机器学习模型,并比较了这些模型在性能方面的结果。我们对 2020 年 3 月 1 日至 2021 年 3 月 31 日期间加拿大艾伯塔省所有长期护理居民进行了回顾性队列研究。我们使用单变量和多变量逻辑回归(LR)模型,通过估计优势比(OR)和 95%置信区间来确定 60 天全因死亡率的预测因素。为了确定最佳的敏感性-特异性截止点,采用了约登指数。我们开发了几个机器学习模型,以确定在性能方面最佳的模型。在这项队列研究中,年龄增长、男性、症状、先前入院和某些特定合并症与死亡率增加有关。机器学习和预处理方法为提高死亡率风险预测提供了一种潜在有价值的方法,但需要做更多的工作来证明其在标准风险因素之外的改进。