Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA.
Medical Scientist Training Program, Vanderbilt University School of Medicine, Nashville, TN, USA.
J Biomed Inform. 2021 May;117:103748. doi: 10.1016/j.jbi.2021.103748. Epub 2021 Mar 25.
Identifying symptoms and characteristics highly specific to coronavirus disease 2019 (COVID-19) would improve the clinical and public health response to this pandemic challenge. Here, we describe a high-throughput approach - Concept-Wide Association Study (ConceptWAS) - that systematically scans a disease's clinical manifestations from clinical notes. We used this method to identify symptoms specific to COVID-19 early in the course of the pandemic.
We created a natural language processing pipeline to extract concepts from clinical notes in a local ER corresponding to the PCR testing date for patients who had a COVID-19 test and evaluated these concepts as predictors for developing COVID-19. We identified predictors from Firth's logistic regression adjusted by age, gender, and race. We also performed ConceptWAS using cumulative data every two weeks to identify the timeline for recognition of early COVID-19-specific symptoms.
We processed 87,753 notes from 19,692 patients subjected to COVID-19 PCR testing between March 8, 2020, and May 27, 2020 (1,483 COVID-19-positive). We found 68 concepts significantly associated with a positive COVID-19 test. We identified symptoms associated with increasing risk of COVID-19, including "anosmia" (odds ratio [OR] = 4.97, 95% confidence interval [CI] = 3.21-7.50), "fever" (OR = 1.43, 95% CI = 1.28-1.59), "cough with fever" (OR = 2.29, 95% CI = 1.75-2.96), and "ageusia" (OR = 5.18, 95% CI = 3.02-8.58). Using ConceptWAS, we were able to detect loss of smell and loss of taste three weeks prior to their inclusion as symptoms of the disease by the Centers for Disease Control and Prevention (CDC).
ConceptWAS, a high-throughput approach for exploring specific symptoms and characteristics of a disease like COVID-19, offers a promise for enabling EHR-powered early disease manifestations identification.
识别与 2019 年冠状病毒病(COVID-19)高度特异的症状和特征将改善对这一流行疾病挑战的临床和公共卫生应对。在这里,我们描述了一种高通量方法——概念广泛关联研究(ConceptWAS)——它系统地从临床记录中扫描疾病的临床表现。我们使用这种方法在大流行早期识别 COVID-19 的特定症状。
我们创建了一个自然语言处理管道,从当地急诊室与患者 COVID-19 检测 PCR 检测日期相对应的临床记录中提取概念,并将这些概念评估为发展 COVID-19 的预测因子。我们通过 Firth 逻辑回归调整年龄、性别和种族来识别预测因子。我们还使用每两周累积的数据进行 ConceptWAS,以识别识别早期 COVID-19 特定症状的时间表。
我们处理了 19692 名接受 COVID-19 PCR 检测的患者在 2020 年 3 月 8 日至 2020 年 5 月 27 日之间的 87753 份记录(1483 份 COVID-19 阳性)。我们发现了 68 个与 COVID-19 阳性检测显著相关的概念。我们确定了与 COVID-19 风险增加相关的症状,包括“嗅觉丧失”(优势比[OR] = 4.97,95%置信区间[CI] = 3.21-7.50)、“发热”(OR = 1.43,95%CI = 1.28-1.59)、“发热咳嗽”(OR = 2.29,95%CI = 1.75-2.96)和“味觉丧失”(OR = 5.18,95%CI = 3.02-8.58)。使用 ConceptWAS,我们能够在疾病预防控制中心(CDC)将嗅觉丧失和味觉丧失纳入疾病症状之前的三周检测到它们。
ConceptWAS 是一种探索 COVID-19 等疾病特定症状和特征的高通量方法,为利用电子健康记录识别疾病的早期表现提供了一种可能。