Section of Social Medicine, Department of Public Health, University of Copenhagen, Copenhagen, Denmark.
Department of Translational Type 1 Diabetes Research, Steno Diabetes Center Copenhagen, Herlev, Denmark.
Int J Epidemiol. 2024 Jun 12;53(4). doi: 10.1093/ije/dyae097.
Empirical evaluation of inverse probability weighting (IPW) for self-selection bias correction is inaccessible without the full source population. We aimed to: (i) investigate how self-selection biases frequency and association measures and (ii) assess self-selection bias correction using IPW in a cohort with register linkage.
The source population included 17 936 individuals invited to the Copenhagen Aging and Midlife Biobank during 2009-11 (ages 49-63 years). Participants counted 7185 (40.1%). Register data were obtained for every invited person from 7 years before invitation to the end of 2020. The association between education and mortality was estimated using Cox regression models among participants, IPW participants and the source population.
Participants had higher socioeconomic position and fewer hospital contacts before baseline than the source population. Frequency measures of participants approached those of the source population after IPW. Compared with primary/lower secondary education, upper secondary, short tertiary, bachelor and master/doctoral were associated with reduced risk of death among participants (adjusted hazard ratio [95% CI]: 0.60 [0.46; 0.77], 0.68 [0.42; 1.11], 0.37 [0.25; 0.54], 0.28 [0.18; 0.46], respectively). IPW changed the estimates marginally (0.59 [0.45; 0.77], 0.57 [0.34; 0.93], 0.34 [0.23; 0.50], 0.24 [0.15; 0.39]) but not only towards those of the source population (0.57 [0.51; 0.64], 0.43 [0.32; 0.60], 0.38 [0.32; 0.47], 0.22 [0.16; 0.29]).
Frequency measures of study participants may not reflect the source population in the presence of self-selection, but the impact on association measures can be limited. IPW may be useful for (self-)selection bias correction, but the returned results can still reflect residual or other biases and random errors.
如果没有完整的源人群,实证评估逆概率加权(Inverse Probability Weighting,简称 IPW)校正自选择偏倚是无法实现的。我们的目的是:(i)研究自选择偏倚如何影响频率和关联度量;(ii)使用 IPW 在具有登记链接的队列中评估自选择偏倚校正。
源人群包括 2009-11 年期间邀请参加哥本哈根老龄化和中年生物库的 17936 人(年龄 49-63 岁)。参与者有 7185 人(40.1%)。从邀请前 7 年到 2020 年底,为每个受邀者获取登记数据。在参与者、IPW 参与者和源人群中,使用 Cox 回归模型估计教育程度与死亡率之间的关联。
与源人群相比,参与者在基线前具有更高的社会经济地位和更少的住院接触。经过 IPW 后,参与者的频率测量值接近源人群。与初等/低等教育相比,中等教育、短期高等教育、学士学位和硕士/博士学位与参与者的死亡风险降低相关(调整后的危险比[95%CI]:0.60[0.46;0.77],0.68[0.42;1.11],0.37[0.25;0.54],0.28[0.18;0.46])。IPW 略微改变了估计值(0.59[0.45;0.77],0.57[0.34;0.93],0.34[0.23;0.50],0.24[0.15;0.39]),但并没有完全回归到源人群的估计值(0.57[0.51;0.64],0.43[0.32;0.60],0.38[0.32;0.47],0.22[0.16;0.29])。
在存在自选择的情况下,研究参与者的频率测量值可能无法反映源人群,但对关联测量值的影响可能有限。IPW 可能有助于(自)选择偏倚校正,但返回的结果仍可能反映残留或其他偏差和随机误差。