Sousa Vitor C, Fritz Marielle, Beaumont Mark A, Chikhi Lounès
Instituto Gulbenkian de Ciência, Rua da Quinta Grande, Oeiras, Portugal.
Genetics. 2009 Apr;181(4):1507-19. doi: 10.1534/genetics.108.098129. Epub 2009 Feb 2.
In recent years approximate Bayesian computation (ABC) methods have become popular in population genetics as an alternative to full-likelihood methods to make inferences under complex demographic models. Most ABC methods rely on the choice of a set of summary statistics to extract information from the data. In this article we tested the use of the full allelic distribution directly in an ABC framework. Although the ABC techniques are becoming more widely used, there is still uncertainty over how they perform in comparison with full-likelihood methods. We thus conducted a simulation study and provide a detailed examination of ABC in comparison with full likelihood in the case of a model of admixture. This model assumes that two parental populations mixed at a certain time in the past, creating a hybrid population, and that the three populations then evolve under pure drift. Several aspects of ABC methodology were investigated, such as the effect of the distance metric chosen to measure the similarity between simulated and observed data sets. Results show that in general ABC provides good approximations to the posterior distributions obtained with the full-likelihood method. This suggests that it is possible to apply ABC using allele frequencies to make inferences in cases where it is difficult to select a set of suitable summary statistics and when the complexity of the model or the size of the data set makes it computationally prohibitive to use full-likelihood methods.
近年来,近似贝叶斯计算(ABC)方法在群体遗传学中变得流行起来,成为在复杂人口模型下进行推断的全似然方法的替代方法。大多数ABC方法依赖于选择一组汇总统计量来从数据中提取信息。在本文中,我们测试了直接在ABC框架中使用完整等位基因分布的情况。尽管ABC技术的使用越来越广泛,但与全似然方法相比,它们的性能仍存在不确定性。因此,我们进行了一项模拟研究,并在混合模型的情况下,对ABC与全似然进行了详细比较。该模型假设两个亲本群体在过去的某个时间混合,形成一个杂交群体,然后这三个群体在纯漂变下进化。我们研究了ABC方法的几个方面,例如选择用来衡量模拟数据集和观测数据集之间相似性的距离度量的影响。结果表明,一般来说,ABC能很好地近似用全似然方法获得的后验分布。这表明,在难以选择一组合适的汇总统计量的情况下,以及当模型的复杂性或数据集的大小使得使用全似然方法在计算上令人望而却步时,可以应用基于等位基因频率的ABC进行推断。