Wang J
Institute of Zoology, Zoological Society of London, London, NW1 4RY, UK.
Mol Ecol. 2015 Jul;24(14):3546-58. doi: 10.1111/mec.13204. Epub 2015 May 14.
The widely applied genetic differentiation statistics F(ST) and G(ST) have recently been criticized for underestimating differentiation when applied to highly polymorphic markers such as microsatellites. New statistics claimed to be unaffected by marker polymorphisms have been proposed and advocated to replace the traditional F(ST) and G(ST). This study shows that G(ST) gives accurate estimates and underestimates of differentiation when demographic factors are more and less important than mutations, respectively. In the former case, all markers, regardless of diversity (H(S)), have the same G(ST) value in expectation and thus give replicated estimates of differentiation. In the latter case, markers of higher H(S) have lower G(ST) values, resulting in a negative, roughly linear correlation between G(ST) and H(S) across loci. I propose that the correlation coefficient between G(ST) and H(S) across loci, r(GH), can be used to distinguish the two cases and to detect mutational effects on G(ST). A highly negative and significant r(GH), when coupled with highly variable G(ST) values among loci, would reveal that marker G(ST) values are affected substantially by mutations and marker diversity, underestimate population differentiation, and are not comparable among studies, species and markers. Simulated and empirical data sets are used to check the power and statistical behaviour, and to demonstrate the usefulness of the correlation analysis.
广泛应用的遗传分化统计量F(ST)和G(ST)最近受到批评,因为当应用于高度多态性标记(如微卫星)时,它们会低估分化程度。有人提出并主张采用据称不受标记多态性影响的新统计量来取代传统的F(ST)和G(ST)。本研究表明,当人口统计学因素分别比突变重要程度低和高时,G(ST)会分别给出准确的分化估计值和低估的分化估计值。在前一种情况下,所有标记,无论多样性(H(S))如何,预期的G(ST)值都相同,因此能给出重复的分化估计值。在后一种情况下,H(S)较高的标记具有较低的G(ST)值,导致跨位点的G(ST)与H(S)之间呈负的、大致线性的相关性。我提出,跨位点的G(ST)与H(S)之间的相关系数r(GH)可用于区分这两种情况,并检测突变对G(ST)的影响。当高度负且显著的r(GH)与位点间高度可变的G(ST)值相结合时,将表明标记的G(ST)值受到突变和标记多样性的显著影响,低估了种群分化,并且在不同研究、物种和标记之间不可比。使用模拟和实证数据集来检验功效和统计行为,并证明相关分析的有用性。