Wang Yiting, Localio Russell, Rebbeck Timothy R
Department of Biostatistics and Epidemiology, University of Pennsylvania School of Medicine, 904 Blockley Hall, 423 Guardian Drive, Philadelphia, PA 19104-6021, USA.
Cancer Epidemiol Biomarkers Prev. 2006 Jan;15(1):124-32. doi: 10.1158/1055-9965.EPI-05-0304.
Confounding by ethnicity (i.e. population stratification) can result in bias and incorrect inferences in genotype-disease association studies, but the effect of population stratification in gene-gene or gene-environment interaction studies has not been addressed. We used logistic regression models to fit multiplicative interactions between two dichotomous variables that represented genetic and/or environmental factors for a binary disease outcome in a hypothetical cohort of multiple ethnicities. Biases in main effects and interactions due to population stratification were evaluated by comparing regression coefficients in mis-specified models that ignored ethnicities with their counterparts in models that accounted for ethnicities. We showed that biases in main effects and interactions were constrained by the differences in disease risks across the ethnicities. Therefore, large biases due to population stratification are not possible when baseline disease risk differences among ethnicities are small or moderate. Numerical examples of biases in genotype-genotype and/or genotype-environment interactions suggested that biases due to population stratification for main effects were generally small but could become large for studies of interactions, particularly when strong linkage disequilibrium between genes or large correlations between genetic and environmental factors existed. However, when linkage disequilibrium among genes or correlations among genes and environments were small, biases to main effects or interaction odds ratios were small to nonexistent.
种族因素导致的混杂(即群体分层)会在基因-疾病关联研究中产生偏差和错误推断,但群体分层在基因-基因或基因-环境相互作用研究中的影响尚未得到探讨。我们使用逻辑回归模型来拟合两个二分变量之间的乘法相互作用,这两个变量代表了一个多民族假设队列中二元疾病结局的遗传和/或环境因素。通过比较忽略种族的错误设定模型中的回归系数与考虑种族的模型中的回归系数,评估了群体分层对主效应和相互作用的偏差。我们表明,主效应和相互作用中的偏差受到不同种族疾病风险差异的限制。因此,当不同种族之间的基线疾病风险差异较小或适中时,群体分层导致的大偏差是不可能出现的。基因型-基因型和/或基因型-环境相互作用偏差的数值示例表明,群体分层对主效应的偏差通常较小,但在相互作用研究中可能会变得很大,特别是当基因之间存在强连锁不平衡或遗传与环境因素之间存在高度相关性时。然而,当基因之间的连锁不平衡或基因与环境之间的相关性较小时,对主效应或相互作用优势比的偏差很小或不存在。