School of Life Sciences, Arizona State University, Tempe, AZ, United States.
Evolution. 2023 Oct 3;77(10):2113-2127. doi: 10.1093/evolut/qpad120.
The detection of selective sweeps from population genomic data often relies on the premise that the beneficial mutations in question have fixed very near the sampling time. As it has been previously shown that the power to detect a selective sweep is strongly dependent on the time since fixation as well as the strength of selection, it is naturally the case that strong, recent sweeps leave the strongest signatures. However, the biological reality is that beneficial mutations enter populations at a rate, one that partially determines the mean wait time between sweep events and hence their age distribution. An important question thus remains about the power to detect recurrent selective sweeps when they are modeled by a realistic mutation rate and as part of a realistic distribution of fitness effects, as opposed to a single, recent, isolated event on a purely neutral background as is more commonly modeled. Here we use forward-in-time simulations to study the performance of commonly used sweep statistics, within the context of more realistic evolutionary baseline models incorporating purifying and background selection, population size change, and mutation and recombination rate heterogeneity. Results demonstrate the important interplay of these processes, necessitating caution when interpreting selection scans; specifically, false-positive rates are in excess of true-positive across much of the evaluated parameter space, and selective sweeps are often undetectable unless the strength of selection is exceptionally strong.
从群体基因组数据中检测到选择清除(selective sweeps)通常依赖于一个前提,即所涉及的有益突变已经非常接近采样时间固定下来。因为先前已经表明,检测选择清除的能力强烈依赖于固定时间以及选择强度,所以强的、最近的清除会留下最强的特征。然而,生物现实是有益突变以一定的速率进入种群,该速率部分决定了清除事件之间的平均等待时间,从而决定了它们的年龄分布。因此,当它们由现实的突变率建模,并作为适应度效应的现实分布的一部分,而不是通常建模的纯中性背景上的单个最近孤立事件时,仍然存在一个重要的问题,即当它们作为现实的突变率和适应度效应分布的一部分被建模时,检测复发性选择清除的能力如何。在这里,我们使用向前时间模拟,在包含纯化和背景选择、种群大小变化以及突变和重组率异质性的更现实的进化基线模型的背景下,研究常用清除统计数据的性能。结果表明,这些过程之间存在重要的相互作用,在解释选择扫描时需要谨慎;具体来说,在评估的参数空间的大部分范围内,假阳性率超过了真阳性率,除非选择强度异常强,否则选择清除通常是不可检测的。