Reich David E, Schaffner Stephen F, Daly Mark J, McVean Gil, Mullikin James C, Higgins John M, Richter Daniel J, Lander Eric S, Altshuler David
Whitehead Institute/MIT Center for Genome Research, One Kendall Square, Cambridge, Massachusetts 02139, USA.
Nat Genet. 2002 Sep;32(1):135-42. doi: 10.1038/ng947. Epub 2002 Aug 5.
Variation in the human genome sequence is key to understanding susceptibility to disease in modern populations and the history of ancestral populations. Unlocking this information requires knowledge of the patterns and underlying causes of human sequence diversity. By applying a new population-genetic framework to two genome-wide polymorphism surveys, we find that the human genome contains sizeable regions (stretching over tens of thousands of base pairs) that have intrinsically high and low rates of sequence variation. We show that the primary determinant of these patterns is shared genealogical history. Only a fraction of the variation (at most 25%) is due to the local mutation rate. By measuring the average distance over which genealogical histories are typically preserved, these data provide the first genome-wide estimate of the average extent of correlation among variants (linkage disequilibrium). The results are best explained by extreme variability in the recombination rate at a fine scale, and provide the first empirical evidence that such recombination 'hot spots' are a general feature of the human genome and have a principal role in shaping genetic variation in the human population.
人类基因组序列的变异是理解现代人群疾病易感性和祖先人群历史的关键。要解开这些信息,需要了解人类序列多样性的模式和潜在原因。通过将一种新的群体遗传学框架应用于两项全基因组多态性调查,我们发现人类基因组包含相当大的区域(延伸数万碱基对),其序列变异率本质上有高有低。我们表明,这些模式的主要决定因素是共同的系谱历史。只有一小部分变异(最多25%)是由于局部突变率。通过测量系谱历史通常得以保留的平均距离,这些数据提供了全基因组范围内对变异间关联平均程度(连锁不平衡)的首次估计。结果最好用精细尺度上重组率的极端变异性来解释,并提供了首个实证证据,表明这种重组“热点”是人类基因组的一个普遍特征,并且在塑造人类群体的遗传变异中起主要作用。