Wang Jian, Shete Sanjay
Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA.
Ann Hum Genet. 2012 Nov;76(6):484-99. doi: 10.1111/j.1469-1809.2012.00725.x. Epub 2012 Aug 10.
A genome-wide association (GWA) study is usually designed as a case-control study, where the presence and absence of the primary disease define the cases and controls, respectively. Using the existing data from GWA studies, investigators are also trying to identify the association between genetic variants and secondary phenotypes, which are defined as traits associated with the primary disease. However, recent studies have shown that bias arises in the estimation of marker-secondary phenotype association using originally collected data. We recently proposed a bias correction approach to accurately estimate the odds ratio (OR) for marker-secondary phenotype association. In this communication, we further investigated whether our bias correction approach is robust for a scenario involving the interactive effect of the secondary phenotype and genetic variants on the primary disease. We found that in such a scenario, our bias correction approach also provides an accurate estimation of OR for marker-secondary phenotype association. We investigated accuracy of our approach using simulation studies and showed that the approach better controlled for type I errors than the existing approaches. We also applied our bias correction approach to the real data analysis of association between an N-acetyltransferase gene, NAT2, and smoking on the basis of colorectal adenoma data.
全基因组关联(GWA)研究通常设计为病例对照研究,其中原发性疾病的存在和不存在分别定义病例组和对照组。利用GWA研究的现有数据,研究人员也在试图确定基因变异与次要表型之间的关联,次要表型被定义为与原发性疾病相关的特征。然而,最近的研究表明,使用最初收集的数据估计标记物与次要表型的关联时会出现偏差。我们最近提出了一种偏差校正方法,以准确估计标记物与次要表型关联的比值比(OR)。在本通讯中,我们进一步研究了我们的偏差校正方法在涉及次要表型和基因变异对原发性疾病的交互作用的情况下是否稳健。我们发现,在这种情况下,我们的偏差校正方法也能准确估计标记物与次要表型关联的OR。我们通过模拟研究调查了我们方法的准确性,结果表明该方法比现有方法能更好地控制I型错误。我们还将我们的偏差校正方法应用于基于结直肠腺瘤数据的N-乙酰转移酶基因NAT2与吸烟之间关联的实际数据分析。