Choisy M, Franck P, Cornuet J-M
Centre d'Etude sur le Polymorphisme des Micro-organismes, UMR CNRS-IRD 9926, Montpellier, France.
Mol Ecol. 2004 Apr;13(4):955-68. doi: 10.1111/j.1365-294x.2004.02107.x.
Several methods have been developed to estimate the parental contributions in the genetic pool of an admixed population. Some pair-comparisons have been performed on real data but, to date, no systematic comparison of a large number of methods has been attempted. In this study, we performed a simulated data-based comparison of six of the most cited methods in the literature of the last 20 years. Five of these methods use allele frequencies and differ in the statistical treatment of the data. The last one also considers the degree of molecular divergence by estimating the coalescence times. Comparisons are based on the frequency at which the method can be applied, the bias and the mean square error of the estimation, and the frequency at which the true value is within the confidence interval. Eventually, each method was applied to a real data set of variously introgressed honeybee populations. In optimal conditions (highly differentiated parental populations, recent hybridization event), all methods perform equally well. When conditions are not optimal, the methods perform differently, but no method is always better or worse than all others. Some guidelines are given for the choice of the method.
已经开发了几种方法来估计混合群体基因库中的亲本贡献。已经对实际数据进行了一些配对比较,但迄今为止,尚未尝试对大量方法进行系统比较。在本研究中,我们对过去20年文献中引用最多的六种方法进行了基于模拟数据的比较。其中五种方法使用等位基因频率,并且在数据的统计处理上有所不同。最后一种方法还通过估计合并时间来考虑分子分歧程度。比较基于该方法可应用的频率、估计的偏差和均方误差,以及真实值在置信区间内的频率。最终,每种方法都应用于不同渗入程度的蜜蜂群体的真实数据集。在最佳条件下(高度分化的亲本群体,近期杂交事件),所有方法表现同样良好。当条件不是最佳时,这些方法表现不同,但没有一种方法总是比其他所有方法更好或更差。给出了一些方法选择的指导原则。