Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Department of Health and Human Services, Rockville, MD 20852, USA.
Proc Natl Acad Sci U S A. 2011 Nov 1;108(44):18026-31. doi: 10.1073/pnas.1114759108. Epub 2011 Oct 14.
Recent discoveries of hundreds of common susceptibility SNPs from genome-wide association studies provide a unique opportunity to examine population genetic models for complex traits. In this report, we investigate distributions of various population genetic parameters and their interrelationships using estimates of allele frequencies and effect-size parameters for about 400 susceptibility SNPs across a spectrum of qualitative and quantitative traits. We calibrate our analysis by statistical power for detection of SNPs to account for overrepresentation of variants with larger effect sizes in currently known SNPs that are expected due to statistical power for discovery. Across all qualitative disease traits, minor alleles conferred "risk" more often than "protection." Across all traits, an inverse relationship existed between "regression effects" and allele frequencies. Both of these trends were remarkably strong for type I diabetes, a trait that is most likely to be influenced by selection, but were modest for other traits such as human height or late-onset diseases such as type II diabetes and cancers. Across all traits, the estimated effect-size distribution suggested the existence of increasingly large numbers of susceptibility SNPs with decreasingly small effects. For most traits, the set of SNPs with intermediate minor allele frequencies (5-20%) contained an unusually small number of susceptibility loci and explained a relatively small fraction of heritability compared with what would be expected from the distribution of SNPs in the general population. These trends could have several implications for future studies of common and uncommon variants.
最近从全基因组关联研究中发现了数百个常见的易感性单核苷酸多态性(SNP),为研究复杂性状的群体遗传模型提供了独特的机会。在本报告中,我们使用约 400 个易感性 SNP 在各种定性和定量性状上的等位基因频率和效应大小参数估计值,研究了各种群体遗传参数的分布及其相互关系。我们通过对 SNP 检测的统计功效进行校准,以解释当前已知 SNP 中较大效应大小的变体的代表性过高,这是由于发现 SNP 的统计功效所致。在所有定性疾病性状中,次要等位基因更经常赋予“风险”而不是“保护”。在所有性状中,“回归效应”与等位基因频率之间存在反比关系。这两种趋势在 I 型糖尿病(一种最有可能受到选择影响的性状)中非常明显,但在其他性状(如人类身高)或迟发性疾病(如 II 型糖尿病和癌症)中则适中。在所有性状中,估计的效应大小分布表明,具有越来越小效应的易感性 SNP 的数量越来越多。对于大多数性状,中间频率的 SNP (5-20%)的 SNP 集合包含异常少的易感基因座,与从一般人群中 SNP 的分布中预期的相比,它们解释的遗传力相对较小。这些趋势可能对常见和罕见变体的未来研究有几种影响。