Usher Institute, College of Medicine and Veterinary Medicine, University of Edinburgh, Edinburgh, Scotland.
Public Health Scotland, Glasgow, Scotland.
PLoS Med. 2020 Oct 20;17(10):e1003374. doi: 10.1371/journal.pmed.1003374. eCollection 2020 Oct.
The objectives of this study were to identify risk factors for severe coronavirus disease 2019 (COVID-19) and to lay the basis for risk stratification based on demographic data and health records.
The design was a matched case-control study. Severe COVID-19 was defined as either a positive nucleic acid test for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in the national database followed by entry to a critical care unit or death within 28 days or a death certificate with COVID-19 as underlying cause. Up to 10 controls per case matched for sex, age, and primary care practice were selected from the national population register. For this analysis-based on ascertainment of positive test results up to 6 June 2020, entry to critical care up to 14 June 2020, and deaths registered up to 14 June 2020-there were 36,948 controls and 4,272 cases, of which 1,894 (44%) were care home residents. All diagnostic codes from the past 5 years of hospitalisation records and all drug codes from prescriptions dispensed during the past 240 days were extracted. Rate ratios for severe COVID-19 were estimated by conditional logistic regression. In a logistic regression using the age-sex distribution of the national population, the odds ratios for severe disease were 2.87 for a 10-year increase in age and 1.63 for male sex. In the case-control analysis, the strongest risk factor was residence in a care home, with rate ratio 21.4 (95% CI 19.1-23.9, p = 8 × 10-644). Univariate rate ratios for conditions listed by public health agencies as conferring high risk were 2.75 (95% CI 1.96-3.88, p = 6 × 10-9) for type 1 diabetes, 1.60 (95% CI 1.48-1.74, p = 8 × 10-30) for type 2 diabetes, 1.49 (95% CI 1.37-1.61, p = 3 × 10-21) for ischemic heart disease, 2.23 (95% CI 2.08-2.39, p = 4 × 10-109) for other heart disease, 1.96 (95% CI 1.83-2.10, p = 2 × 10-78) for chronic lower respiratory tract disease, 4.06 (95% CI 3.15-5.23, p = 3 × 10-27) for chronic kidney disease, 5.4 (95% CI 4.9-5.8, p = 1 × 10-354) for neurological disease, 3.61 (95% CI 2.60-5.00, p = 2 × 10-14) for chronic liver disease, and 2.66 (95% CI 1.86-3.79, p = 7 × 10-8) for immune deficiency or suppression. Seventy-eight percent of cases and 52% of controls had at least one listed condition (51% of cases and 11% of controls under age 40). Severe disease was associated with encashment of at least one prescription in the past 9 months and with at least one hospital admission in the past 5 years (rate ratios 3.10 [95% CI 2.59-3.71] and 2.75 [95% CI 2.53-2.99], respectively) even after adjusting for the listed conditions. In those without listed conditions, significant associations with severe disease were seen across many hospital diagnoses and drug categories. Age and sex provided 2.58 bits of information for discrimination. A model based on demographic variables, listed conditions, hospital diagnoses, and prescriptions provided an additional 1.07 bits (C-statistic 0.804). A limitation of this study is that records from primary care were not available.
We have shown that, along with older age and male sex, severe COVID-19 is strongly associated with past medical history across all age groups. Many comorbidities beyond the risk conditions designated by public health agencies contribute to this. A risk classifier that uses all the information available in health records, rather than only a limited set of conditions, will more accurately discriminate between low-risk and high-risk individuals who may require shielding until the epidemic is over.
本研究旨在确定严重新型冠状病毒病 2019(COVID-19)的危险因素,并为基于人口统计学数据和健康记录的风险分层奠定基础。
本研究设计为匹配的病例对照研究。严重 COVID-19 定义为国家数据库中严重急性呼吸综合征冠状病毒 2(SARS-CoV-2)核酸检测阳性,随后入住重症监护病房或 28 天内死亡,或死亡证明为 COVID-19 为根本死因。从国家人口登记处中为每个病例选择最多 10 名性别、年龄和初级保健实践相匹配的对照。基于截至 2020 年 6 月 6 日的阳性检测结果确定、截至 2020 年 6 月 14 日的入住重症监护病房和截至 2020 年 6 月 14 日登记的死亡情况,共纳入 36948 名对照和 4272 例病例,其中 1894 例(44%)为护理院居民。从过去 5 年的住院记录中提取所有诊断代码,以及过去 240 天内开具的所有药物代码。通过条件逻辑回归估计严重 COVID-19 的率比。在使用全国人口年龄性别分布的逻辑回归中,年龄每增加 10 岁,疾病严重程度的优势比为 2.87,男性为 1.63。在病例对照分析中,最强的危险因素是居住在护理院,率比为 21.4(95%CI 19.1-23.9,p=8×10-644)。公共卫生机构列为高风险的疾病条件的单变量率比为 1.96(95%CI 1.83-2.10,p=2×10-78);1 型糖尿病为 2.75(95%CI 1.96-3.88,p=6×10-9);2 型糖尿病为 1.60(95%CI 1.48-1.74,p=8×10-30);缺血性心脏病为 1.49(95%CI 1.37-1.61,p=3×10-21);其他心脏病为 2.23(95%CI 2.08-2.39,p=4×10-109);慢性下呼吸道疾病为 1.96(95%CI 1.83-2.10,p=2×10-78);慢性肾脏病为 4.06(95%CI 3.15-5.23,p=3×10-27);神经疾病为 5.4(95%CI 4.9-5.8,p=1×10-354);慢性肝病为 3.61(95%CI 2.60-5.00,p=2×10-14);免疫缺陷或抑制为 2.66(95%CI 1.86-3.79,p=7×10-8)。78%的病例和 52%的对照至少有一种列出的疾病(40 岁以下病例为 51%,对照为 11%)。在过去 9 个月内至少开出一种处方且在过去 5 年内至少有一次住院记录(率比分别为 3.10[95%CI 2.59-3.71]和 2.75[95%CI 2.53-2.99])与严重疾病相关,即使在调整列出的疾病后也是如此。在没有列出疾病的情况下,许多与医院诊断和药物类别相关的严重疾病也存在显著关联。年龄和性别为鉴别提供了 2.58 位信息。基于人口统计学变量、列出的疾病、医院诊断和处方的模型提供了另外 1.07 位信息(C 统计量 0.804)。本研究的一个局限性是初级保健记录不可用。
我们已经表明,除了年龄较大和男性外,严重 COVID-19 与所有年龄组的既往病史密切相关。公共卫生机构指定的风险条件之外的许多合并症也促成了这一点。使用健康记录中所有可用信息而不是仅使用有限的一组条件的风险分类器,将更准确地区分低风险和高风险个体,这些个体可能需要在疫情结束前进行隔离。