Department of Epidemiology, Colorado School of Public Health, Aurora, Colorado, USA.
Department of Orthopedics, Musculoskeletal Research Center, University of Colorado Anschutz Medical Campus, Aurora, Colorado, USA.
Genet Epidemiol. 2021 Sep;45(6):593-603. doi: 10.1002/gepi.22418. Epub 2021 Jun 15.
Omics studies frequently use samples collected during cohort studies. Conditioning on sample availability can cause selection bias if sample availability is nonrandom. Inverse probability weighting (IPW) is purported to reduce this bias. We evaluated IPW in an epigenome-wide analysis testing the association between DNA methylation (261,435 probes) and age in healthy adolescent subjects (n = 114). We simulated age and sex to be correlated with sample selection and then evaluated four conditions: complete population/no selection bias (all subjects), naïve selection bias (no adjustment), and IPW selection bias (selection bias with IPW adjustment). Assuming the complete population condition represented the "truth," we compared each condition to the complete population condition. Bias or difference in associations between age and methylation was reduced in the IPW condition versus the naïve condition. However, genomic inflation and type 1 error were higher in the IPW condition relative to the naïve condition. Postadjustment using bacon, type 1 error and inflation were similar across all conditions. Power was higher under the IPW condition compared with the naïve condition before and after inflation adjustment. IPW methods can reduce bias in genome-wide analyses. Genomic inflation is a potential concern that can be minimized using methods that adjust for inflation.
组学研究经常使用队列研究中收集的样本。如果样本的可用性不是随机的,那么基于样本可用性进行条件处理可能会导致选择偏差。逆概率加权(Inverse Probability Weighting,简称 IPW)据称可以减少这种偏差。我们在一项全基因组范围内的分析中评估了 IPW,该分析测试了 DNA 甲基化(261,435 个探针)与健康青少年受试者年龄(n=114)之间的关联。我们模拟了年龄和性别与样本选择相关,并评估了四种情况:完全人群/无选择偏差(所有受试者)、幼稚选择偏差(无调整)和 IPW 选择偏差(有 IPW 调整的选择偏差)。假设完全人群条件代表“真相”,我们将每种情况与完全人群条件进行比较。与幼稚情况相比,IPW 情况下年龄与甲基化之间的关联偏差或差异较小。然而,与幼稚情况相比,IPW 情况下的基因组膨胀和 1 型错误更高。在所有条件下,使用 bacon 进行后调整后,1 型错误和膨胀相似。在调整膨胀之前和之后,IPW 条件下的功效均高于幼稚条件。IPW 方法可以减少全基因组分析中的偏差。基因组膨胀是一个潜在的问题,可以通过调整膨胀的方法来最小化。