Division of Biostatistics, New York University, School of Medicine, New York, New York, USA.
Genet Epidemiol. 2010 Dec;34(8):792-802. doi: 10.1002/gepi.20523.
With the advent of dense single nucleotide polymorphism genotyping, population-based association studies have become the major tools for identifying human disease genes and for fine gene mapping of complex traits. We develop a genotype-based approach for association analysis of case-control studies of gene-environment interactions in the case when environmental factors are measured with error and genotype data are available on multiple genetic markers. To directly use the observed genotype data, we propose two genotype-based models: genotype effect and additive effect models. Our approach offers several advantages. First, the proposed risk functions can directly incorporate the observed genotype data while modeling the linkage disequilibrium information in the regression coefficients, thus eliminating the need to infer haplotype phase. Compared with the haplotype-based approach, an estimating procedure based on the proposed methods can be much simpler and significantly faster. In addition, there is no potential risk due to haplotype phase estimation. Further, by fitting the proposed models, it is possible to analyze the risk alleles/variants of complex diseases, including their dominant or additive effects. To model measurement error, we adopt the pseudo-likelihood method by Lobach et al. [2008]. Performance of the proposed method is examined using simulation experiments. An application of our method is illustrated using a population-based case-control study of association between calcium intake with the risk of colorectal adenoma development.
随着密集单核苷酸多态性基因分型的出现,基于人群的关联研究已成为鉴定人类疾病基因和复杂性状精细基因定位的主要工具。当环境因素存在测量误差且存在多个遗传标记的基因型数据时,我们开发了一种基于基因型的方法,用于基因-环境相互作用的病例对照研究的关联分析。为了直接使用观察到的基因型数据,我们提出了两种基于基因型的模型:基因型效应和加性效应模型。我们的方法具有几个优点。首先,所提出的风险函数可以直接整合观察到的基因型数据,同时对回归系数中的连锁不平衡信息进行建模,从而消除了推断单倍型相位的需要。与基于单倍型的方法相比,基于所提出的方法的估计过程可以更简单且显著更快。此外,由于单倍型相位估计,不存在潜在风险。此外,通过拟合所提出的模型,可以分析复杂疾病的风险等位基因/变体,包括它们的显性或加性效应。为了模拟测量误差,我们采用了 Lobach 等人[2008]提出的伪似然方法。通过模拟实验检验了所提出方法的性能。我们的方法的一个应用是使用基于人群的病例对照研究,研究钙摄入量与结直肠腺瘤发展风险之间的关联。