Suppr超能文献

逆概率加权法是一种有效的方法,可以在分析高维数据时解决选择偏差问题。

Inverse probability weighting is an effective method to address selection bias during the analysis of high dimensional data.

机构信息

Department of Epidemiology, Colorado School of Public Health, Aurora, Colorado, USA.

Department of Orthopedics, Musculoskeletal Research Center, University of Colorado Anschutz Medical Campus, Aurora, Colorado, USA.

出版信息

Genet Epidemiol. 2021 Sep;45(6):593-603. doi: 10.1002/gepi.22418. Epub 2021 Jun 15.

Abstract

Omics studies frequently use samples collected during cohort studies. Conditioning on sample availability can cause selection bias if sample availability is nonrandom. Inverse probability weighting (IPW) is purported to reduce this bias. We evaluated IPW in an epigenome-wide analysis testing the association between DNA methylation (261,435 probes) and age in healthy adolescent subjects (n = 114). We simulated age and sex to be correlated with sample selection and then evaluated four conditions: complete population/no selection bias (all subjects), naïve selection bias (no adjustment), and IPW selection bias (selection bias with IPW adjustment). Assuming the complete population condition represented the "truth," we compared each condition to the complete population condition. Bias or difference in associations between age and methylation was reduced in the IPW condition versus the naïve condition. However, genomic inflation and type 1 error were higher in the IPW condition relative to the naïve condition. Postadjustment using bacon, type 1 error and inflation were similar across all conditions. Power was higher under the IPW condition compared with the naïve condition before and after inflation adjustment. IPW methods can reduce bias in genome-wide analyses. Genomic inflation is a potential concern that can be minimized using methods that adjust for inflation.

摘要

组学研究经常使用队列研究中收集的样本。如果样本的可用性不是随机的,那么基于样本可用性进行条件处理可能会导致选择偏差。逆概率加权(Inverse Probability Weighting,简称 IPW)据称可以减少这种偏差。我们在一项全基因组范围内的分析中评估了 IPW,该分析测试了 DNA 甲基化(261,435 个探针)与健康青少年受试者年龄(n=114)之间的关联。我们模拟了年龄和性别与样本选择相关,并评估了四种情况:完全人群/无选择偏差(所有受试者)、幼稚选择偏差(无调整)和 IPW 选择偏差(有 IPW 调整的选择偏差)。假设完全人群条件代表“真相”,我们将每种情况与完全人群条件进行比较。与幼稚情况相比,IPW 情况下年龄与甲基化之间的关联偏差或差异较小。然而,与幼稚情况相比,IPW 情况下的基因组膨胀和 1 型错误更高。在所有条件下,使用 bacon 进行后调整后,1 型错误和膨胀相似。在调整膨胀之前和之后,IPW 条件下的功效均高于幼稚条件。IPW 方法可以减少全基因组分析中的偏差。基因组膨胀是一个潜在的问题,可以通过调整膨胀的方法来最小化。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5125/8376760/d066b756439e/nihms-1709983-f0001.jpg

相似文献

7
Review of inverse probability weighting for dealing with missing data.逆概率加权法处理缺失数据的综述。
Stat Methods Med Res. 2013 Jun;22(3):278-95. doi: 10.1177/0962280210395740. Epub 2011 Jan 10.
10
Combining multiple imputation and inverse-probability weighting.结合多重填补法和逆概率加权法。
Biometrics. 2012 Mar;68(1):129-37. doi: 10.1111/j.1541-0420.2011.01666.x. Epub 2011 Nov 3.

引用本文的文献

9
Birthweight, gestational age, and early school trajectory.出生体重、胎龄和早期学业轨迹。
BMC Public Health. 2023 May 31;23(1):1032. doi: 10.1186/s12889-023-15913-3.

本文引用的文献

1
Assessing exposure effects on gene expression.评估暴露效应对基因表达的影响。
Genet Epidemiol. 2020 Sep;44(6):601-610. doi: 10.1002/gepi.22324. Epub 2020 Jun 8.
7
An introduction to g methods.G方法简介。
Int J Epidemiol. 2017 Apr 1;46(2):756-762. doi: 10.1093/ije/dyw323.
9
Selection Bias Due to Loss to Follow Up in Cohort Studies.队列研究中失访导致的选择偏倚。
Epidemiology. 2016 Jan;27(1):91-7. doi: 10.1097/EDE.0000000000000409.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验