Nieto-Gutierrez Wendy, Campos-Chambergo Jaid, Gonzalez-Ayala Enrique, Oyola-Garcia Oswaldo, Alejandro-Mora Alberti, Luis-Aguirre Eliana, Pasquel-Santillan Roly, Leiva-Aguirre Juan, Ugarte-Gil Cesar, Loyola Steev
Facultad de Salud Pública, Universidad Peruana Cayetano Heredia, Lima, Perú.
Universidad Científica del Sur, Lima, Perú.
PLOS Glob Public Health. 2024 Jan 29;4(1):e0002854. doi: 10.1371/journal.pgph.0002854. eCollection 2024.
There are initiatives to promote the creation of predictive COVID-19 fatality models to assist decision-makers. The study aimed to develop prediction models for COVID-19 fatality using population data recorded in the national epidemiological surveillance system of Peru. A retrospective cohort study was conducted (March to September of 2020). The study population consisted of confirmed COVID-19 cases reported in the surveillance system of nine provinces of Lima, Peru. A random sample of 80% of the study population was selected, and four prediction models were constructed using four different strategies to select variables: 1) previously analyzed variables in machine learning models; 2) based on the LASSO method; 3) based on significance; and 4) based on a post-hoc approach with variables consistently included in the three previous strategies. The internal validation was performed with the remaining 20% of the population. Four prediction models were successfully created and validate using data from 22,098 cases. All models performed adequately and similarly; however, we selected models derived from strategy 1 (AUC 0.89, CI95% 0.87-0.91) and strategy 4 (AUC 0.88, CI95% 0.86-0.90). The performance of both models was robust in validation and sensitivity analyses. This study offers insights into estimating COVID-19 fatality within the Peruvian population. Our findings contribute to the advancement of prediction models for COVID-19 fatality and may aid in identifying individuals at increased risk, enabling targeted interventions to mitigate the disease. Future studies should confirm the performance and validate the usefulness of the models described here under real-world conditions and settings.
目前有多项举措致力于推动创建预测新冠病毒死亡情况的模型,以协助决策者。本研究旨在利用秘鲁国家流行病学监测系统中记录的人口数据,开发新冠病毒死亡情况的预测模型。开展了一项回顾性队列研究(2020年3月至9月)。研究人群包括秘鲁利马九个省份监测系统中报告的新冠病毒确诊病例。选取了研究人群80%的随机样本,并采用四种不同的变量选择策略构建了四个预测模型:1)机器学习模型中先前分析过的变量;2)基于套索(LASSO)方法;3)基于显著性;4)基于事后分析方法,将前三种策略中始终包含的变量纳入。使用其余20%的人群进行内部验证。利用22,098例病例的数据成功创建并验证了四个预测模型。所有模型的表现都足够且相似;然而,我们选择了源自策略1(曲线下面积[AUC]为0.89,95%置信区间[CI]为0.87 - 0.91)和策略4(AUC为0.88,CI95%为0.86 - 0.90)的模型。这两个模型在验证和敏感性分析中的表现都很稳健。本研究为估计秘鲁人群中的新冠病毒死亡情况提供了见解。我们的研究结果有助于推进新冠病毒死亡情况预测模型的发展,并可能有助于识别风险增加的个体,从而实现有针对性的干预措施以减轻疾病影响。未来的研究应在现实世界的条件和环境下确认此处所述模型的性能并验证其有用性。