International Agency for Research on Cancer, Lyon, France.
Genet Epidemiol. 2010 Jan;34(1):16-25. doi: 10.1002/gepi.20429.
With the advent of rapid and relatively cheap genotyping technologies there is now the opportunity to attempt to identify gene-environment and gene-gene interactions when the number of genes and environmental factors is potentially large. Unfortunately the dimensionality of the parameter space leads to a computational explosion in the number of possible interactions that may be investigated. The full model that includes all interactions and main effects can be unstable, with wide confidence intervals arising from the large number of estimated parameters. We describe a hierarchical mixture model that allows all interactions to be investigated simultaneously, but assumes the effects come from a mixture prior with two components, one that reflects small null effects and the second for epidemiologically significant effects. Effects from the former are effectively set to zero, hence increasing the power for the detection of real signals. The prior framework is very flexible, which allows substantive information to be incorporated into the analysis. We illustrate the methods first using simulation, and then on data from a case-control study of lung cancer in Central and Eastern Europe.
随着快速且相对廉价的基因分型技术的出现,现在有机会尝试在基因和环境因素数量较多的情况下,确定基因-环境和基因-基因相互作用。不幸的是,参数空间的维度导致可能研究的相互作用数量呈计算爆炸式增长。包含所有相互作用和主效应的完整模型可能不稳定,由于估计参数数量众多,置信区间较宽。我们描述了一种层次混合模型,该模型允许同时研究所有相互作用,但假设效应来自具有两个分量的混合先验,一个反映小的零效应,第二个反映具有流行病学意义的效应。前者的效应实际上被设置为零,从而提高了检测真实信号的能力。该先验框架非常灵活,允许将实质性信息纳入分析中。我们首先使用模拟来说明这些方法,然后再使用中欧和东欧肺癌病例对照研究的数据进行说明。