Merle C, Leblois R, Rousset F, Pudlo P
Institut Montpelliérain Alexander Grothendieck UMR CNRS 5149, Université de Montpellier, Place Eugène Bataillon, 34095 Montpellier, France; Centre de Biologie pour la Gestion des Populations, Inra, CBGP (UMR INRA / IRD / Cirad / Montpellier SupAgro), Campus International de Baillarguet, 34988 Montferrier sur Lez, France; Institut de Biologie Computationnelle, Université de Montpellier, 95 rue de la Galera, 34095 Montpellier, France.
Centre de Biologie pour la Gestion des Populations, Inra, CBGP (UMR INRA / IRD / Cirad / Montpellier SupAgro), Campus International de Baillarguet, 34988 Montferrier sur Lez, France; Institut de Biologie Computationnelle, Université de Montpellier, 95 rue de la Galera, 34095 Montpellier, France.
Theor Popul Biol. 2017 Apr;114:70-87. doi: 10.1016/j.tpb.2016.09.002. Epub 2016 Oct 3.
Sequential importance sampling algorithms have been defined to estimate likelihoods in models of ancestral population processes. However, these algorithms are based on features of the models with constant population size, and become inefficient when the population size varies in time, making likelihood-based inferences difficult in many demographic situations. In this work, we modify a previous sequential importance sampling algorithm to improve the efficiency of the likelihood estimation. Our procedure is still based on features of the model with constant size, but uses a resampling technique with a new resampling probability distribution depending on the pairwise composite likelihood. We tested our algorithm, called sequential importance sampling with resampling (SISR) on simulated data sets under different demographic cases. In most cases, we divided the computational cost by two for the same accuracy of inference, in some cases even by one hundred. This study provides the first assessment of the impact of such resampling techniques on parameter inference using sequential importance sampling, and extends the range of situations where likelihood inferences can be easily performed.
顺序重要性抽样算法已被定义用于估计祖先群体过程模型中的似然性。然而,这些算法基于群体大小恒定的模型特征,当群体大小随时间变化时效率会变低,这使得在许多人口统计学情形下基于似然性的推断变得困难。在这项工作中,我们修改了之前的顺序重要性抽样算法以提高似然性估计的效率。我们的方法仍然基于群体大小恒定的模型特征,但使用了一种重采样技术,该技术具有取决于成对复合似然性的新重采样概率分布。我们在不同人口统计学情形下的模拟数据集上测试了我们称为带重采样的顺序重要性抽样(SISR)的算法。在大多数情况下,对于相同的推断精度,我们将计算成本减半,在某些情况下甚至减为百分之一。本研究首次评估了此类重采样技术对使用顺序重要性抽样进行参数推断的影响,并扩展了可以轻松进行似然性推断的情形范围。