Laboratory of Genetics, University of Wisconsin, USA.
Mol Biol Evol. 2013 Feb;30(2):285-98. doi: 10.1093/molbev/mss247. Epub 2012 Oct 27.
The ability to survey polymorphism on a genomic scale has enabled genome-wide scans for the targets of natural selection. Theory that connects patterns of genetic variation to evidence of natural selection most often assumes a diallelic locus and no recurrent mutation. Although these assumptions are suitable to selection that targets single nucleotide variants, fundamentally different types of mutation generate abundant polymorphism in genomes. Moreover, recent empirical results suggest that mutationally complex, multiallelic loci including microsatellites and copy number variants are sometimes targeted by natural selection. Given their abundance, the lack of inference methods tailored to the mutational peculiarities of these types of loci represents a notable gap in our ability to interrogate genomes for signatures of natural selection. Previous theoretical investigations of mutation-selection balance at multiallelic loci include assumptions that limit their application to inference from empirical data. Focusing on microsatellites, we assess the dynamics and population-level consequences of selection targeting mutationally complex variants. We develop general models of a multiallelic fitness surface, a realistic model of microsatellite mutation, and an efficient simulation algorithm. Using these tools, we explore mutation-selection-drift equilibrium at microsatellites and investigate the mutational history and selective regime of the microsatellite that causes Friedreich's ataxia. We characterize microsatellite selective events by their duration and cost, note similarities to sweeps from standing point variation, and conclude that it is premature to label microsatellites as ubiquitous agents of efficient adaptive change. Together, our models and simulation algorithm provide a powerful framework for statistical inference, which can be used to test the neutrality of microsatellites and other multiallelic variants.
大规模调查多态性的能力使人们能够对自然选择的目标进行全基因组扫描。将遗传变异模式与自然选择证据联系起来的理论通常假设存在二态性位点且没有反复突变。尽管这些假设适用于以单核苷酸变体为目标的选择,但产生大量多态性的突变类型根本不同。此外,最近的实证结果表明,突变复杂的多等位基因座,包括微卫星和拷贝数变异,有时是自然选择的目标。鉴于它们的丰富性,缺乏针对这些类型的突变特征量身定制的推断方法是我们探究基因组中自然选择特征的能力的一个显著差距。多等位基因座突变-选择平衡的先前理论研究包括了限制其从经验数据推断应用的假设。我们专注于微卫星,评估针对突变复杂变体的选择的动态和群体水平后果。我们开发了多等位基因适合度曲面的一般模型、微卫星突变的现实模型和高效的模拟算法。使用这些工具,我们探索了微卫星的突变-选择-漂变平衡,并研究了导致弗里德里希共济失调的微卫星的突变历史和选择机制。我们通过其持续时间和成本来描述微卫星的选择事件,注意到与来自固定点变异的扫荡的相似之处,并得出结论,将微卫星标记为有效的适应性变化的普遍因素还为时过早。总的来说,我们的模型和模拟算法为统计推断提供了一个强大的框架,可以用于测试微卫星和其他多等位基因变体的中性。