Li W H
Genetics. 1986 May;113(1):187-213. doi: 10.1093/genetics/113.1.187.
Mathematical formulas are developed for the evolutionary change of restriction cleavage sites in a DNA sequence, allowing unequal rates between transitional and transversional types of nucleotide substitution. Formulas are also developed for the probability of having a particular pattern of site changes among evolutionary lineages, such as parallel gains or losses of sites, and for inferring the presence or absence of a restriction site in an ancestral sequence from data on the present-day sequences. The unordered compatibility method is proposed for inferring the phylogenetic relationships among relatively closely related organisms, treating restriction sites as cladistic characters. Formulas are derived for the probability (P+) of obtaining the correct network for a given number (N) of informative sites for the cases of four and five species. These formulas are applied to evaluate the performance of the method and to estimate the N value required for P+ to be 95% or larger. The method performs well when the branches between ancestral nodes and the branches leading to the two most recent species are more or less equal in length, but performs poorly when the latter two branches are considerably longer than the former.
本文推导了DNA序列中限制性酶切位点进化变化的数学公式,该公式考虑了核苷酸替换的转换型和颠换型之间的不等速率。同时还推导了进化谱系中出现特定位点变化模式(如位点的平行获得或丢失)的概率公式,以及根据现今序列数据推断祖先序列中限制性酶切位点存在与否的公式。本文提出了无序兼容性方法,将限制性酶切位点视为分支分类特征,用于推断亲缘关系相对较近的生物体之间的系统发育关系。推导了在四种和五种物种的情况下,对于给定数量(N)的信息位点获得正确网络的概率(P+)的公式。这些公式用于评估该方法的性能,并估计使P+达到95%或更高所需的N值。当祖先节点之间的分支以及通向两个最近物种的分支长度大致相等时,该方法表现良好,但当后两个分支比前两个分支长得多时,该方法表现较差。