From the Department of Emergency Medicine, University of Texas Southwestern Medical Center, Dallas, TX, USA.
the Clinical Informatics Center, University of Texas Southwestern Medical Center, Dallas, TX, USA.
Acad Emerg Med. 2021 Feb;28(2):206-214. doi: 10.1111/acem.14182. Epub 2020 Dec 22.
The COVID-19 pandemic has placed acute care providers in demanding situations in predicting disease given the clinical variability, desire to cohort patients, and high variance in testing availability. An approach to stratifying patients by likelihood of disease based on rapidly available emergency department (ED) clinical data would offer significant operational and clinical value. The purpose of this study was to develop and internally validate a predictive model to aid in the discrimination of patients undergoing investigation for COVID-19.
All patients greater than 18 years presenting to a single academic ED who were tested for COVID-19 during this index ED evaluation were included. Outcome was defined as the result of COVID-19 polymerase chain reaction (PCR) testing during the index visit or any positive result within the following 7 days. Variables included chest radiograph interpretation, disease-specific screening questions, and laboratory data. Three models were developed with a split-sample approach to predict outcome of the PCR test utilizing logistic regression, random forest, and gradient-boosted decision tree methods. Model discrimination was evaluated comparing area under the receiver operator curve (AUC) and point statistics at a predefined threshold.
A total of 1,026 patients were included in the study collected between March and April 2020. Overall, there was disease prevalence of 9.6% in the population under study during this time frame. The logistic regression model was found to have an AUC of 0.89 (95% confidence interval [CI] = 0.84 to 0.94) when including four features: exposure history, temperature, white blood cell count (WBC), and chest radiograph result. Random forest method resulted in AUC of 0.86 (95% CI = 0.79 to 0.92) and gradient boosting had an AUC of 0.85 (95% CI = 0.79 to 0.91). With a consistently held negative predictive value, the logistic regression model had a positive predictive value of 0.29 (0.2-0.39) compared to 0.2 (0.14-0.28) for random forest and 0.22 (0.15-0.3) for the gradient-boosted method.
The derived predictive models offer good discriminating capacity for COVID-19 disease and provide interpretable and usable methods for those providers caring for these patients at the important crossroads of the community and the health system. We found utilization of the logistic regression model utilizing exposure history, temperature, WBC, and chest X-ray result had the greatest discriminatory capacity with the most interpretable model. Integrating a predictive model-based approach to COVID-19 testing decisions and patient care pathways and locations could add efficiency and accuracy to decrease uncertainty.
在预测疾病方面,COVID-19 大流行使急症护理提供者面临严峻挑战,因为疾病存在临床变异性、患者分组的意愿以及检测可及性差异很大。基于在急诊科(ED)迅速获得的临床数据对疾病可能性进行分层的方法将具有显著的操作和临床价值。本研究的目的是开发并内部验证一个预测模型,以帮助区分接受 COVID-19 检查的患者。
纳入在这段时间内于单一学术 ED 就诊且在该 ED 评估期间接受 COVID-19 检测的年龄大于 18 岁的所有患者。结局定义为在本次就诊期间 COVID-19 聚合酶链反应(PCR)检测的结果,或在随后的 7 天内出现任何阳性结果。变量包括胸部 X 线片解读、疾病特异性筛查问题和实验室数据。采用拆分样本的方法,使用逻辑回归、随机森林和梯度提升决策树方法开发了三个模型,以预测 PCR 检测的结果。通过比较接受者操作特征曲线(AUC)下的面积和预定阈值下的点统计数据来评估模型的区分度。
研究共纳入 2020 年 3 月至 4 月期间的 1026 例患者。在此期间,研究人群的总体疾病患病率为 9.6%。当包含四个特征(暴露史、体温、白细胞计数(WBC)和胸部 X 线片结果)时,逻辑回归模型的 AUC 为 0.89(95%置信区间[CI]0.84-0.94)。随机森林方法的 AUC 为 0.86(95%CI 0.79-0.92),梯度提升的 AUC 为 0.85(95%CI 0.79-0.91)。在保持一致的阴性预测值的情况下,逻辑回归模型的阳性预测值为 0.29(0.2-0.39),而随机森林为 0.2(0.14-0.28),梯度提升模型为 0.22(0.15-0.3)。
这些预测模型具有良好的 COVID-19 疾病区分能力,为那些在社区和医疗系统的重要交叉点护理这些患者的提供者提供了可解释且可用的方法。我们发现,使用暴露史、体温、WBC 和胸部 X 射线结果的逻辑回归模型具有最大的区分能力和最具可解释性的模型。整合基于预测模型的 COVID-19 检测决策和患者护理路径和地点的方法可以提高效率和准确性,减少不确定性。