Payami Haydeh, Zhu Motao, Montimurro Jennifer, Keefe Robert, McCulloch Colin C, Moses Lina
New York State Department of Health Wadsworth Center, Genomics Institute, P.O. Box 22002, 12201-2002, Albany, NY, USA.
Hum Genet. 2005 Dec;118(3-4):322-30. doi: 10.1007/s00439-005-0057-1. Epub 2005 Sep 28.
Association studies are the most powerful method available for identifying modest gene effects in complex disorders, but they often produce inconsistent results. With the rapidly growing SNP databases, haplotype maps and high throughput genotyping, the use of association studies is expected to increase; therefore, it is critical and timely that the problems with study design are identified and fixed. We questioned if unrecognized allele and genotype frequency variations in controls could be responsible for some of the inconsistent association findings. We performed a population genetic study of apolipoprotein E (APOE) and cytochrome P450 2D6 (CYP2D6) in 1,748 individuals ranging in age from newborns to centenarians. Although APOE and CYP2D6 are two of the most commonly used candidate genes, this is the first study to examine age- and gender-specific frequency distributions over the entire age spectrum, using a large, ethnically and geographically uniform population. We found significant, previously unrecognized variations in APOE allele frequencies, and deviations from Hardy-Weinberg expectations in CYP2D6 genotype frequencies starting at birth. The allele frequency variations within controls were larger than some reported case-control differences. We demonstrate that unrecognized frequency fluctuations in controls are a serious and potentially common confounder whose impact on association studies has not been appreciated, and one that can be addressed with proper study design. We recommend that population genetic studies be performed on commonly used candidate markers and that rigorous standards be applied for case-control matching.
关联研究是识别复杂疾病中微小基因效应的最有效方法,但这些研究结果往往并不一致。随着单核苷酸多态性(SNP)数据库、单倍型图谱和高通量基因分型技术的迅速发展,关联研究的应用有望增加;因此,识别并解决研究设计中的问题至关重要且迫在眉睫。我们质疑对照人群中未被识别的等位基因和基因型频率变化是否可能是导致某些不一致关联研究结果的原因。我们对1748名年龄从新生儿到百岁老人的个体进行了载脂蛋白E(APOE)和细胞色素P450 2D6(CYP2D6)的群体遗传学研究。尽管APOE和CYP2D6是最常用的两个候选基因,但这是首次使用一个种族和地域均一的大样本人群,在整个年龄范围内研究年龄和性别特异性频率分布的研究。我们发现APOE等位基因频率存在显著的、此前未被识别的变化,并且从出生起CYP2D6基因型频率就偏离了哈迪-温伯格平衡预期。对照人群中等位基因频率的变化大于一些已报道的病例对照差异。我们证明对照人群中未被识别的频率波动是一个严重且可能普遍存在的混杂因素,其对关联研究的影响尚未得到重视,并且可以通过适当的研究设计来解决。我们建议对常用的候选标记进行群体遗传学研究,并对病例对照匹配应用严格的标准。