Department of Plant Ecology and Genetics, Uppsala University, SE-752 36 Uppsala, Sweden.
G3 (Bethesda). 2012 Jul;2(7):803-14. doi: 10.1534/g3.112.002410. Epub 2012 Jul 1.
Most species have at least some level of genetic structure. Recent simulation studies have shown that it is important to consider population structure when sampling individuals to infer past population history. The relevance of the results of these computer simulations for empirical studies, however, remains unclear. In the present study, we use DNA sequence datasets collected from two closely related species with very different histories, the selfing species Capsella rubella and its outcrossing relative C. grandiflora, to assess the impact of different sampling strategies on summary statistics and the inference of historical demography. Sampling strategy did not strongly influence the mean values of Tajima's D in either species, but it had some impact on the variance. The general conclusions about demographic history were comparable across sampling schemes even when resampled data were analyzed with approximate Bayesian computation (ABC). We used simulations to explore the effects of sampling scheme under different demographic models. We conclude that when sequences from modest numbers of loci (<60) are analyzed, the sampling strategy is generally of limited importance. The same is true under intermediate or high levels of gene flow (4Nm > 2-10) in models in which global expansion is combined with either local expansion or hierarchical population structure. Although we observe a less severe effect of sampling than predicted under some earlier simulation models, our results should not be seen as an encouragement to neglect this issue. In general, a good coverage of the natural range, both within and between populations, will be needed to obtain a reliable reconstruction of a species's demographic history, and in fact, the effect of sampling scheme on polymorphism patterns may itself provide important information about demographic history.
大多数物种都具有一定程度的遗传结构。最近的模拟研究表明,在取样个体以推断过去的种群历史时,考虑种群结构非常重要。然而,这些计算机模拟结果对于实证研究的相关性尚不清楚。在本研究中,我们使用从两个具有非常不同历史的密切相关物种收集的 DNA 序列数据集,即自交物种 Capsella rubella 和其异交相关物种 C. grandiflora,来评估不同采样策略对汇总统计数据和历史人口统计学推断的影响。采样策略并没有强烈影响两个物种中 Tajima 的 D 的平均值,但对方差有一定影响。即使使用近似贝叶斯计算 (ABC) 分析重采样数据,关于人口历史的总体结论在不同的采样方案中是可比的。我们使用模拟来探索不同人口模型下采样方案的影响。我们得出结论,当分析来自适度数量的基因座 (<60) 的序列时,采样策略通常不重要。在模型中,当全球扩张与局部扩张或分层种群结构相结合时,在中等或高水平的基因流动 (4Nm > 2-10) 下也是如此。尽管我们观察到采样的影响比某些早期模拟模型预测的要小,但我们的结果不应被视为忽视这个问题的鼓励。一般来说,为了可靠地重建物种的人口历史,需要对自然范围进行良好的覆盖,包括种群内部和种群之间,实际上,采样方案对多态性模式的影响本身可能提供有关人口历史的重要信息。