Suppr超能文献

基于遗传算法的分子系统发育最大似然分析。

Genetic algorithm-based maximum-likelihood analysis for molecular phylogeny.

作者信息

Katoh K, Kuma K, Miyata T

机构信息

Department of Biophysics, Graduate School of Science, Kyoto University, Kyoto 606-8502, Japan.

出版信息

J Mol Evol. 2001 Oct-Nov;53(4-5):477-84. doi: 10.1007/s002390010238.

Abstract

A heuristic approach to search for the maximum-likelihood (ML) phylogenetic tree based on a genetic algorithm (GA) has been developed. It outputs the best tree as well as multiple alternative trees that are not significantly worse than the best one on the basis of the likelihood criterion. These near-optimum trees are subjected to further statistical tests. This approach enables ones to infer phylogenetic trees of over 20 taxa taking account of the rate heterogeneity among sites on practical time scales on a PC cluster. Computer simulations were conducted to compare the efficiency of the present approach with that of several likelihood-based methods and distance-based methods, using amino acid sequence data of relatively large (5-24) taxa. The superiority of the ML method over distance-based methods increases as the condition of simulations becomes more realistic (an incorrect model is assumed or many taxa are involved). This approach was applied to the inference of the universal tree based on the concatenated amino acid sequences of vertically descendent genes that are shared among all genomes whose complete sequences have been reported. The inferred tree strongly supports that Archaea is paraphyletic and Eukarya is specifically related to Crenarchaeota. Apart from the paraphyly of Archaea and some minor disagreements, the universal tree based on these genes is largely consistent with the universal tree based on SSU rRNA.

摘要

已开发出一种基于遗传算法(GA)搜索最大似然(ML)系统发育树的启发式方法。它会输出最佳树以及多个替代树,这些替代树根据似然准则并不比最佳树差很多。这些接近最优的树会接受进一步的统计检验。这种方法能够让人们在个人计算机集群上,在实际时间尺度内考虑位点间的速率异质性,推断出超过20个分类单元的系统发育树。使用相对较大(5 - 24个)分类单元的氨基酸序列数据进行了计算机模拟,以比较本方法与几种基于似然的方法和基于距离的方法的效率。随着模拟条件变得更加现实(假设模型不正确或涉及许多分类单元),ML方法相对于基于距离的方法的优势会增加。该方法应用于基于所有已报道完整序列的基因组中共享的垂直遗传基因的串联氨基酸序列推断通用树。推断出的树强烈支持古菌是并系的,且真核生物与泉古菌有特定关系。除了古菌的并系性和一些小的分歧外,基于这些基因的通用树与基于小亚基核糖体RNA(SSU rRNA)的通用树在很大程度上是一致的。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验