Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland.
Mol Biol Evol. 2013 Feb;30(2):457-68. doi: 10.1093/molbev/mss227. Epub 2012 Sep 19.
Reconstruction of the past is an important task of evolutionary biology. It takes place at different points in a hierarchy of molecular variation, including genes, individuals, populations, and species. Statistical inference about population histories has recently received considerable attention, following the development of computational tools to provide tractable approaches to this very challenging problem. Here, we introduce a likelihood-based approach which generalizes a recently developed model for random fluctuations in allele frequencies based on an approximation to the neutral Wright-Fisher diffusion. Our new framework approximates the infinite alleles Wright-Fisher model and uses an implementation with an adaptive Markov chain Monte Carlo algorithm. The method is especially well suited to data sets harboring large population samples and relatively few loci for which other likelihood-based models are currently computationally intractable. Using our model, we reconstruct the global population history of a major human pathogen, Streptococcus pneumoniae. The results illustrate the potential to reach important biological insights to an evolutionary process by a population genetics approach, which can appropriately accommodate very large population samples.
重建过去是进化生物学的一项重要任务。它发生在分子变异的不同层次上,包括基因、个体、种群和物种。最近,随着计算工具的发展,为解决这一极具挑战性的问题提供了可行的方法,种群历史的统计推断受到了相当大的关注。在这里,我们引入了一种基于似然的方法,该方法基于对中性 Wright-Fisher 扩散的近似,推广了最近开发的用于等位基因频率随机波动的模型。我们的新框架近似无限等位基因 Wright-Fisher 模型,并使用具有自适应马尔可夫链蒙特卡罗算法的实现。该方法特别适合于包含大量种群样本和相对较少基因座的数据集,而目前其他基于似然的模型在计算上是不可行的。使用我们的模型,我们重建了主要人类病原体肺炎链球菌的全球种群历史。结果说明了通过群体遗传学方法达到对进化过程的重要生物学见解的潜力,该方法可以适当地包含非常大的种群样本。