Biodesign Center for Mechanisms of Evolution, Arizona State University.
Genome Biol Evol. 2020 Apr 1;12(4):443-455. doi: 10.1093/gbe/evaa056.
The ability to obtain genome-wide sequences of very large numbers of individuals from natural populations raises questions about optimal sampling designs and the limits to extracting information on key population-genetic parameters from temporal-survey data. Methods are introduced for evaluating whether observed temporal fluctuations in allele frequencies are consistent with the hypothesis of random genetic drift, and expressions for the expected sampling variances for the relevant statistics are given in terms of sample sizes and numbers. Estimation methods and aspects of statistical reliability are also presented for the mean and temporal variance of selection coefficients. For nucleotide sites that pass the test of neutrality, the current effective population size can be estimated by a method of moments, and expressions for its sampling variance provide insight into the degree to which such methodology can yield meaningful results under alternative sampling schemes. Finally, some caveats are raised regarding the use of the temporal covariance of allele-frequency change to infer selection. Taken together, these results provide a statistical view of the limits to population-genetic inference in even the simplest case of a closed population.
从自然种群中获取大量个体的全基因组序列的能力引发了关于最佳抽样设计的问题,并限制了从时间调查数据中提取关键种群遗传参数信息的能力。本文介绍了评估观测到的等位基因频率的时间波动是否与随机遗传漂变假设一致的方法,并给出了相关统计数据的预期抽样方差的表达式,这些表达式涉及样本量和数量。还介绍了选择系数均值和时间方差的估计方法和统计可靠性方面的内容。对于通过中性检验的核苷酸位点,可以通过矩法估计当前有效种群大小,其抽样方差的表达式提供了对在替代抽样方案下该方法能产生有意义结果的程度的深入了解。最后,对使用等位基因频率变化的时间协方差来推断选择提出了一些注意事项。总之,这些结果提供了即使在最简单的封闭种群情况下,种群遗传推断的限制的统计观点。