Wang J
Institute of Zoology, Zoological Society of London, London, UK.
J Evol Biol. 2014 Mar;27(3):518-30. doi: 10.1111/jeb.12315. Epub 2014 Jan 21.
Inbreeding (F) of and relatedness (r) between individuals are now routinely calculated from marker data in studies in the fields of quantitative genetics, conservation genetics, forensics, evolution and ecology. Although definable in terms of either correlation coefficient or probability of identity by descent (IBD) relative to a reference, they are better interpreted as correlations in marker-based analyses because the reference in practice is frequently the current sample or population whose F and r are being estimated. In such situations, negative estimates have a biological meaning, a substantial proportion of the estimates are expected to be negative, and the average estimates are close to zero for r and equivalent to FIS for F. I show that although current r estimators were developed from the IBD-based concept of relatedness, some of them conform to the correlation-based concept of relatedness and some do not. The latter estimators can be modified, however, so that they estimate r as a correlation coefficient. I also show that F and r estimates can be misleading and become biased and marker dependent when a sample containing a high proportion of highly inbred and/or closely related individuals is used as reference. In analyses depending on the comparison between r (or F) estimates and a priori values expected under ideal conditions (e.g. for identifying genealogical relationship), the estimators should be used with caution.
在数量遗传学、保护遗传学、法医学、进化和生态学等领域的研究中,现在通常根据标记数据来计算个体之间的近亲繁殖系数(F)和亲缘关系系数(r)。尽管可以根据相关系数或相对于某个参考的同源性概率(IBD)来定义它们,但在基于标记的分析中,将它们解释为相关性更好,因为在实际操作中,参考通常是当前正在估计其F和r的样本或群体。在这种情况下,负估计值具有生物学意义,预计相当一部分估计值为负,并且对于r,平均估计值接近零,对于F,平均估计值等同于FIS。我表明,尽管当前的r估计器是从基于IBD的亲缘关系概念发展而来的,但其中一些符合基于相关性的亲缘关系概念,而有些则不符合。然而,可以对后者的估计器进行修改,使其将r估计为相关系数。我还表明,当使用包含高比例高度近亲繁殖和/或密切相关个体的样本作为参考时,F和r估计可能会产生误导,变得有偏差且依赖于标记。在依赖于r(或F)估计值与理想条件下预期的先验值之间比较的分析中(例如用于识别谱系关系),应谨慎使用估计器。