Mathieson Iain, Reich David
Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America.
Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America.
PLoS Genet. 2017 Feb 1;13(2):e1006581. doi: 10.1371/journal.pgen.1006581. eCollection 2017 Feb.
Mutations occur at vastly different rates across the genome, and populations, leading to differences in the spectrum of segregating polymorphisms. Here, we investigate variation in the rare variant spectrum in a sample of human genomes representing all major world populations. We find at least two distinct signatures of variation. One, consistent with a previously reported signature is characterized by an increased rate of TCC>TTC mutations in people from Western Eurasia and South Asia, likely related to differences in the rate, or efficiency of repair, of damage due to deamination of methylated guanine. We describe the geographic extent of this signature and show that it is detectable in the genomes of ancient, but not archaic humans. The second signature is private to certain Native American populations, and is concentrated at CpG sites. We show that this signature is not driven by differences in the CpG mutation rate, but is a result of the fact that highly mutable CpG sites are more likely to undergo multiple independent mutations across human populations, and the spectrum of such mutations is highly sensitive to recent demography. Both of these effects dramatically affect the spectrum of rare variants across human populations, and should be taken into account when using mutational clocks to make inference about demography.
突变在整个基因组和不同人群中的发生速率差异极大,导致分离多态性的频谱存在差异。在此,我们在一个代表世界所有主要人群的人类基因组样本中研究稀有变异频谱的变化。我们发现至少有两种不同的变异特征。其一,与先前报道的特征一致,其特点是来自西欧亚大陆和南亚的人群中TCC>TTC突变率增加,这可能与甲基化鸟嘌呤脱氨导致的损伤修复速率或效率差异有关。我们描述了这一特征的地理范围,并表明它在古代人类而非古人类的基因组中可检测到。第二个特征是某些美洲原住民群体所特有的,且集中在CpG位点。我们表明这一特征并非由CpG突变率的差异驱动,而是由于高度可变的CpG位点在不同人群中更有可能经历多次独立突变,并且此类突变的频谱对近期人口统计学极为敏感。这两种效应都极大地影响了整个人类群体中稀有变异的频谱,在使用突变时钟推断人口统计学信息时应予以考虑。