Volz Erik M, Frost Simon D W
J R Soc Interface. 2014 Dec 6;11(101):20140945. doi: 10.1098/rsif.2014.0945.
Many population genetic models have been developed for the purpose of inferring population size and growth rates from random samples of genetic data. We examine two popular approaches to this problem, the coalescent and the birth–death-sampling model (BDM), in the context of estimating population size and birth rates in a population growing exponentially according to the birth–death branching process. For sequences sampled at a single time, we found the coalescent and the BDM gave virtually indistinguishable results in terms of the growth rates and fraction of the population sampled, even when sampling from a small population. For sequences sampled at multiple time points, we find that the birth–death model estimators are subject to large bias if the sampling process is misspecified. Since BDMs incorporate a model of the sampling process, we show how much of the statistical power of BDMs arises from the sequence of sample times and not from the genealogical tree. This motivates the development of a new coalescent estimator, which is augmented with a model of the known sampling process and is potentially more precise than the coalescent that does not use sample time information.
为了从遗传数据的随机样本中推断种群大小和增长率,人们开发了许多种群遗传模型。我们在根据生死分支过程呈指数增长的种群中,研究了两种解决此问题的常用方法,即合并法和生死抽样模型(BDM),以估计种群大小和出生率。对于在单个时间点采样的序列,我们发现,即使是从小种群中采样,合并法和BDM在增长率和采样种群比例方面给出的结果几乎没有区别。对于在多个时间点采样的序列,我们发现,如果采样过程指定错误,生死模型估计量会有很大偏差。由于BDM包含了采样过程的模型,我们展示了BDM的统计能力有多少来自采样时间序列,而不是系谱树。这促使我们开发一种新的合并估计量,它通过已知采样过程的模型进行增强,可能比不使用采样时间信息的合并法更精确。