Dr. von Hauner Children's Hospital, Ludwig Maximilians University Munich, 80337, Munich, Germany,
Eur J Epidemiol. 2013 Oct;28(10):785-97. doi: 10.1007/s10654-013-9837-4. Epub 2013 Sep 5.
Any genome-wide analysis is hampered by reduced statistical power due to multiple comparisons. This is particularly true for interaction analyses, which have lower statistical power than analyses of associations. To assess gene-environment interactions in population settings we have recently proposed a statistical method based on a modified two-step approach, where first genetic loci are selected by their associations with disease and environment, respectively, and subsequently tested for interactions. We have simulated various data sets resembling real world scenarios and compared single-step and two-step approaches with respect to true positive rate (TPR) in 486 scenarios and (study-wide) false positive rate (FPR) in 252 scenarios. Our simulations confirmed that in all two-step methods the two steps are not correlated. In terms of TPR, two-step approaches combining information on gene-disease association and gene-environment association in the first step were superior to all other methods, while preserving a low FPR in over 250 million simulations under the null hypothesis. Our weighted modification yielded the highest power across various degrees of gene-environment association in the controls. An optimal threshold for step 1 depended on the interacting allele frequency and the disease prevalence. In all scenarios, the least powerful method was to proceed directly to an unbiased full interaction model, applying conventional genome-wide significance thresholds. This simulation study confirms the practical advantage of two-step approaches to interaction testing over more conventional one-step designs, at least in the context of dichotomous disease outcomes and other parameters that might apply in real-world settings.
全基因组分析由于多次比较而受到统计能力降低的限制。这对于相互作用分析尤其如此,其统计能力低于关联分析。为了在人群环境中评估基因-环境相互作用,我们最近提出了一种基于两步法改进的统计方法,其中首先分别根据与疾病和环境的关联选择遗传基因座,然后测试相互作用。我们模拟了各种类似于真实世界情况的数据,并在 486 种情况下比较了单步和两步方法的真阳性率(TPR),在 252 种情况下比较了(研究范围)假阳性率(FPR)。我们的模拟结果证实,在所有两步法中,两步之间没有相关性。在 TPR 方面,在第一步中结合基因疾病关联和基因环境关联信息的两步法优于所有其他方法,同时在零假设下,在超过 2.5 亿次模拟中保持低 FPR。我们的加权修改在控制组中各种程度的基因-环境关联中都具有最高的功效。第一步的最优阈值取决于相互作用等位基因频率和疾病流行率。在所有情况下,最不强大的方法是直接进行无偏全相互作用模型,应用常规的全基因组显著水平阈值。这项模拟研究证实,与更传统的一步设计相比,两步法在交互测试方面具有实际优势,至少在二项式疾病结局和其他可能适用于实际环境的参数的情况下是如此。