Cheng K F, Lin W J
Biostatistics Center and Department of Public Health, China Medical University, Taiwan, China.
Am J Hum Genet. 2007 Oct;81(4):726-43. doi: 10.1086/520962. Epub 2007 Aug 22.
In population-based case-control association studies, the regular chi (2) test is often used to investigate association between a candidate locus and disease. However, it is well known that this test may be biased in the presence of population stratification and/or genotyping error. Unlike some other biases, this bias will not go away with increasing sample size. On the contrary, the false-positive rate will be much larger when the sample size is increased. The usual family-based designs are robust against population stratification, but they are sensitive to genotype error. In this article, we propose a novel method of simultaneously correcting for the bias arising from population stratification and/or for the genotyping error in case-control studies. The appropriate corrections depend on sample odds ratios of the standard 2x3 tables of genotype by case and control from null loci. Therefore, the test is simple to apply. The corrected test is robust against misspecification of the genetic model. If the null hypothesis of no association is rejected, the corrections can be further used to estimate the effect of the genetic factor. We considered a simulation study to investigate the performance of the new method, using parameter values similar to those found in real-data examples. The results show that the corrected test approximately maintains the expected type I error rate under various simulation conditions. It also improves the power of the association test in the presence of population stratification and/or genotyping error. The discrepancy in power between the tests with correction and those without correction tends to be more extreme as the magnitude of the bias becomes larger. Therefore, the bias-correction method proposed in this article should be useful for the genetic analysis of complex traits.
在基于人群的病例对照关联研究中,常使用常规的卡方检验来研究候选基因座与疾病之间的关联。然而,众所周知,在存在人群分层和/或基因分型错误的情况下,该检验可能会产生偏差。与其他一些偏差不同,这种偏差不会随着样本量的增加而消失。相反,当样本量增加时,假阳性率会变得更大。通常基于家系的设计对人群分层具有稳健性,但它们对基因分型错误敏感。在本文中,我们提出了一种新方法,可同时校正病例对照研究中因人群分层产生的偏差和/或基因分型错误。适当的校正取决于来自无效基因座的病例和对照的基因型标准2x3表的样本优势比。因此,该检验易于应用。校正后的检验对遗传模型的错误设定具有稳健性。如果拒绝了无关联的零假设,则校正可进一步用于估计遗传因素的效应。我们考虑进行一项模拟研究,使用与实际数据示例中发现的参数值相似的参数值来研究新方法的性能。结果表明,校正后的检验在各种模拟条件下大致保持了预期的I型错误率。它还提高了在存在人群分层和/或基因分型错误时关联检验的效能。随着偏差幅度变大,校正检验与未校正检验之间的效能差异往往会更加极端。因此,本文提出的偏差校正方法应该对复杂性状的遗传分析有用。