Wang Jinliang
Institute of Zoology, Zoological Society of London, Regent's Park, London NW1 4RY, UK.
Genet Res. 2007 Jun;89(3):135-53. doi: 10.1017/S0016672307008798.
Knowledge of the genetic relatedness among individuals is essential in diverse research areas such as behavioural ecology, conservation biology, quantitative genetics and forensics. How to estimate relatedness accurately from genetic marker information has been explored recently by many methodological studies. In this investigation I propose a new likelihood method that uses the genotypes of a triad of individuals in estimating pairwise relatedness (r). The idea is to use a third individual as a control (reference) in estimating the r between two other individuals, thus reducing the chance of genes identical in state being mistakenly inferred as identical by descent. The new method allows for inbreeding and accounts for genotype errors in data. Analyses of both simulated and human microsatellite and SNP datasets show that the quality of r estimates (measured by the root mean squared error, RMSE) is generally improved substantially by the new triadic likelihood method (TL) over the dyadic likelihood method and five moment estimators. Simulations also show that genotyping errors/mutations, when ignored, result in underestimates of r for related dyads, and that incorporating a model of typing errors in the TL method improves r estimates for highly related dyads but impairs those for loosely related or unrelated dyads. The effects of inbreeding were also investigated through simulations. It is concluded that, because most dyads in a natural population are unrelated or only loosely related, the overall performance of the new triadic likelihood method is the best, offering r estimates with a RMSE that is substantially smaller than the five commonly used moment estimators and the dyadic likelihood method.
了解个体间的遗传相关性在行为生态学、保护生物学、数量遗传学和法医学等多个研究领域至关重要。近期,许多方法学研究探讨了如何根据遗传标记信息准确估计相关性。在本研究中,我提出了一种新的似然方法,该方法利用三人组合个体的基因型来估计成对相关性(r)。其思路是在估计另外两个个体之间的r时,将第三个个体用作对照(参考),从而减少状态相同的基因被错误推断为同源相同基因的可能性。新方法考虑了近亲繁殖,并对数据中的基因型错误进行了处理。对模拟的以及人类微卫星和单核苷酸多态性数据集的分析表明,与二元似然方法和五个矩估计器相比,新的三元似然方法(TL)通常能显著提高r估计值的质量(以均方根误差RMSE衡量)。模拟还表明,若忽略基因分型错误/突变,会导致相关二元组的r估计值偏低,并且在TL方法中纳入分型错误模型可改善高度相关二元组的r估计值,但会损害低度相关或不相关二元组的r估计值。还通过模拟研究了近亲繁殖的影响。得出的结论是,由于自然种群中的大多数二元组不相关或仅低度相关,新的三元似然方法的总体性能最佳,其提供的r估计值的RMSE明显小于五种常用的矩估计器和二元似然方法。