Department of Biology, Stanford University, Stanford, CA 94305, USA.
G3 (Bethesda). 2022 Sep 30;12(10). doi: 10.1093/g3journal/jkac205.
Properties of gene genealogies such as tree height (H), total branch length (L), total lengths of external (E) and internal (I) branches, mean length of basal branches (B), and the underlying coalescence times (T) can be used to study population-genetic processes and to develop statistical tests of population-genetic models. Uses of tree features in statistical tests often rely on predictions that depend on pairwise relationships among such features. For genealogies under the coalescent, we provide exact expressions for Taylor approximations to expected values and variances of ratios Xn/Yn, for all 15 pairs among the variables {Hn,Ln,En,In,Bn,Tk}, considering n leaves and 2≤k≤n. For expected values of the ratios, the approximations match closely with empirical simulation-based values. The approximations to the variances are not as accurate, but they generally match simulations in their trends as n increases. Although En has expectation 2 and Hn has expectation 2 in the limit as n→∞, the approximation to the limiting expectation for En/Hn is not 1, instead equaling π2/3-2≈1.28987. The new approximations augment fundamental results in coalescent theory on the shapes of genealogical trees.
基因谱系的性质,如树高(H)、总分支长度(L)、外部(E)和内部(I)分支的总长度、基部分支的平均长度(B)和潜在的合并时间(T),可用于研究种群遗传过程,并开发种群遗传模型的统计检验。在统计检验中,树特征的使用通常依赖于依赖于这些特征之间的成对关系的预测。对于合并模型下的谱系,我们提供了精确的泰勒近似表达式,用于期望和方差的比值 Xn/Yn,对于变量 {Hn,Ln,En,In,Bn,Tk} 中的所有 15 对,考虑到 n 个叶子和 2≤k≤n。对于比值的期望,近似值与基于经验模拟的值非常吻合。方差的近似值不太准确,但随着 n 的增加,它们通常与模拟趋势相匹配。尽管 En 的期望是 2,Hn 的期望是 2 在 n→∞的极限中,En/Hn 的极限期望的近似值不是 1,而是等于 π2/3-2≈1.28987。新的近似值增加了合并理论中关于谱系树形状的基本结果。