Suppr超能文献

基于限制性数据构建系统发育树的最大简约法和距离矩阵法的相对效率

Relative efficiencies of the maximum-parsimony and distance-matrix methods of phylogeny construction for restriction data.

作者信息

Lin J, Nei M

机构信息

Center for Demographic and Population Genetics, University of Texas Health Science Center, Houston.

出版信息

Mol Biol Evol. 1991 May;8(3):356-65. doi: 10.1093/oxfordjournals.molbev.a040648.

Abstract

The relative efficiencies of the maximum-parsimony (MP), UPGMA, and neighbor-joining (NJ) methods in obtaining the correct tree (topology) for restriction-site and restriction-fragment data were studied by computer simulation. In this simulation, six DNA sequences of 16,000 nucleotides were assumed to evolve following a given model tree. The recognition sequences of 20 different six-base restriction enzymes were used to identify the restriction sites of the DNA sequences generated. The restriction-site data and restriction-fragment data thus obtained were used to reconstruct a phylogenetic tree, and the tree obtained was compared with the model tree. This process was repeated 300 times. The results obtained indicate that when the rate of nucleotide substitution is constant the probability of obtaining the correct tree (Pc) is generally higher in the NJ method than in the MP method. However, if we use the average topological deviation from the model tree (dT) as the criterion of comparison, the NJ and MP methods are nearly equally efficient. When the rate of nucleotide substitution varies with evolutionary lineage, the NJ method is better than the MP method, whether Pc or dT is used as the criterion of comparison. With 500 nucleotides and when the number of nucleotide substitutions per site was very small, restriction-site data were, contrary to our expectation, more useful than sequence data. Restriction-fragment data were less useful than restriction-site data, except when the sequence divergence was very small. UPGMA seems to be useful only when the rate of nucleotide substitution is constant and sequence divergence is high.

摘要

通过计算机模拟研究了最大简约法(MP)、UPGMA和邻接法(NJ)在根据限制性酶切位点和限制性片段数据获得正确系统发育树(拓扑结构)方面的相对效率。在该模拟中,假设6条长度为16,000个核苷酸的DNA序列按照给定的模型树进化。使用20种不同的六碱基限制性内切酶的识别序列来确定所生成DNA序列的限制性酶切位点。将由此获得的限制性酶切位点数据和限制性片段数据用于重建系统发育树,并将得到的树与模型树进行比较。此过程重复300次。所得结果表明,当核苷酸替换率恒定时,NJ法获得正确树的概率(Pc)通常高于MP法。然而,如果以与模型树的平均拓扑偏差(dT)作为比较标准,NJ法和MP法的效率几乎相同。当核苷酸替换率随进化谱系变化时,无论以Pc还是dT作为比较标准,NJ法都优于MP法。对于500个核苷酸且每个位点的核苷酸替换数非常小时,与预期相反,限制性酶切位点数据比序列数据更有用。除了序列差异非常小时,限制性片段数据比限制性酶切位点数据用处小。UPGMA似乎仅在核苷酸替换率恒定且序列差异较大时有用。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验