Learoyd Annastazia, Nicholas Jennifer, Hart Nicholas, Douiri Abdel
School of Life Course and Population Sciences, King College London, London, UK.
Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, UK.
BMC Med Res Methodol. 2024 Jul 16;24(1):149. doi: 10.1186/s12874-023-02129-7.
Throughout the Covid-19 pandemic, researchers have made use of electronic health records to research this disease in a rapidly evolving environment of questions and discoveries. These studies are prone to collider bias as they restrict the population of Covid-19 patients to only those with severe disease. Inverse probability weighting is typically used to correct for this bias but requires information from the unrestricted population. Using electronic health records from a South London NHS trust, this work demonstrates a method to correct for collider bias using externally sourced data while examining the relationship between minority ethnicities and poor Covid-19 outcomes.
The probability of inclusion within the observed hospitalised cohort was modelled based on estimates from published national data. The model described the relationship between patient ethnicity, hospitalisation, and death due to Covid-19 - a relationship suggested to be susceptible to collider bias. The obtained probabilities (as applied to the observed patient cohort) were used as inverse probability weights in survival analysis examining ethnicity (and covariates) as a risk factor for death due to Covid-19.
Within the observed cohort, unweighted analysis of survival suggested a reduced risk of death in those of Black ethnicity - differing from the published literature. Applying inverse probability weights to this analysis amended this aberrant result to one more compatible with the literature. This effect was consistent when the analysis was applied to patients within only the first wave of Covid-19 and across two waves of Covid-19 and was robust against adjustments to the modelled relationship between hospitalisation, patient ethnicity, and death due to Covid-19 made as part of a sensitivity analysis.
In conclusion, this analysis demonstrates the feasibility of using external publications to correct for collider bias (or other forms of selection bias) induced by the restriction of a population to a hospitalised cohort using an example from the recent Covid-19 pandemic.
在整个新冠疫情期间,研究人员利用电子健康记录在一个问题和发现迅速演变的环境中对这种疾病进行研究。这些研究容易出现对撞机偏差,因为它们将新冠患者群体限制为仅患有严重疾病的患者。逆概率加权通常用于校正这种偏差,但需要来自未受限群体的信息。利用伦敦南部一家国民保健服务信托机构的电子健康记录,这项工作展示了一种利用外部来源数据校正对撞机偏差的方法,同时研究少数族裔与不良新冠疫情结果之间的关系。
根据已发表的全国数据估计值,对观察到的住院队列中的纳入概率进行建模。该模型描述了患者种族、住院和新冠死亡之间的关系——这种关系被认为容易受到对撞机偏差的影响。将获得的概率(应用于观察到的患者队列)用作生存分析中的逆概率权重,以检验种族(和协变量)作为新冠死亡风险因素的情况。
在观察到的队列中,未加权的生存分析表明黑人种族的死亡风险降低——这与已发表的文献不同。对该分析应用逆概率权重将这一异常结果修正为与文献更相符的结果。当该分析仅应用于新冠疫情第一波期间的患者以及两波新冠疫情期间的患者时,这种效果是一致 的,并且在作为敏感性分析的一部分对住院、患者种族和新冠死亡之间的建模关系进行调整时具有稳健性。
总之,本分析通过最近新冠疫情的一个例子,证明了使用外部出版物校正因将人群限制在住院队列而导致的对撞机偏差(或其他形式的选择偏差)的可行性。