Morrison David A
Department of Parasitology (SWEPAR), National Veterinary Institute and Swedish University of Agricultural Sciences, 751 89 Uppsala, Sweden.
Adv Parasitol. 2006;63:1-124. doi: 10.1016/S0065-308X(06)63001-7.
Phylogenetic analysis has changed greatly in the last decade, and the most important themes in that change are reviewed here. Sequence data have become the most common source of phylogenetic information. This means that explicit models for evolutionary processes have been developed in a likelihood context, which allow more realistic data analyses. These models are becoming increasingly complex, both for nucleotides and for amino acid sequences, and so all such models need to be quantitatively assessed for each data set, to find the most appropriate one for use in any particular tree-building analysis. Bayesian analysis has been developed for tree-building and is greatly increasing in popularity. This is because a good heuristic strategy exists, which allows large data sets to be analyzed with complex evolutionary models in a practical time. Perhaps the most disappointing aspect of tree interpretation is the ongoing confusion between rooted and unrooted trees, while the effect of taxon and character sampling is often overlooked when constructing a phylogeny (especially in parasitology). The review finishes with a detailed consideration of the analysis of a multi-gene data set for several dozen taxa of Cryptosporidium (Apicomplexa), illustrating many of the theoretical and practical points highlighted in the review.
系统发育分析在过去十年中发生了巨大变化,本文将对这一变化中最重要的主题进行综述。序列数据已成为系统发育信息最常见的来源。这意味着在似然框架下已经开发出了用于进化过程的显式模型,从而能够进行更符合实际的数据分析。这些模型对于核苷酸和氨基酸序列来说都变得越来越复杂,因此对于每个数据集都需要对所有此类模型进行定量评估,以找到最适合用于任何特定建树分析的模型。贝叶斯分析已被用于建树,并且越来越受欢迎。这是因为存在一种良好的启发式策略,它能让我们在实际可接受的时间内,使用复杂的进化模型对大数据集进行分析。也许在树的解读方面最令人失望的是,有根树和无根树之间一直存在混淆,而在构建系统发育树时(尤其是在寄生虫学中),分类单元和特征抽样的影响常常被忽视。综述最后详细考虑了对隐孢子虫(顶复门)几十个分类单元的多基因数据集的分析,阐明了综述中强调的许多理论和实践要点。