Galtier N, Gouy M
Université Claude Bernard Lyon 1, Villeurbanne, France.
Mol Biol Evol. 1998 Jul;15(7):871-9. doi: 10.1093/oxfordjournals.molbev.a025991.
A nonhomogeneous, nonstationary stochastic model of DNA sequence evolution allowing varying equilibrium G + C contents among lineages is devised in order to deal with sequences of unequal base compositions. A maximum-likelihood implementation of this model for phylogenetic analyses allows handling of a reasonable number of sequences. The relevance of the model and the accuracy of parameter estimates are theoretically and empirically assessed, using real or simulated data sets. Overall, a significant amount of information about past evolutionary modes can be extracted from DNA sequences, suggesting that process (rates of distinct kinds of nucleotide substitutions) and pattern (the evolutionary tree) can be simultaneously inferred. G + C contents at ancestral nodes are quite accurately estimated. The new method appears to be useful for phylogenetic reconstruction when base composition varies among compared sequences. It may also be suitable for molecular evolution studies.
为处理碱基组成不均等的序列,设计了一种非齐次、非平稳的DNA序列进化随机模型,该模型允许各谱系间平衡的G + C含量有所不同。此模型用于系统发育分析的最大似然实现方式能够处理数量合理的序列。使用真实或模拟数据集,从理论和实证两方面评估了该模型的相关性以及参数估计的准确性。总体而言,可从DNA序列中提取大量有关过去进化模式的信息,这表明可以同时推断过程(不同类型核苷酸替换的速率)和模式(进化树)。祖先节点处的G + C含量能够得到相当准确的估计。当比较序列间的碱基组成不同时,新方法似乎对系统发育重建很有用。它也可能适用于分子进化研究。