Department of Ecology, Evolution and Organismal Biology, Iowa State University, Ames, IA.
Computational Biology and Bioinformatics Unit, Research School of Biology, The Australian National University, Canberra, Australia.
Mol Biol Evol. 2019 Apr 1;36(4):643-649. doi: 10.1093/molbev/msz013.
Resolving the relationships of animals (Metazoa) is crucial to our understanding of the origin of key traits such as muscles, guts, and nerves. However, a broadly accepted metazoan consensus phylogeny has yet to emerge. In part, this is because the genomes of deeply diverging and fast-evolving lineages may undergo significant gene turnover, reducing the number of orthologs shared with related phyla. This can limit the usefulness of traditional phylogenetic methods that rely on alignments of orthologous sequences. Phylogenetic analysis of gene content has the potential to circumvent this orthology requirement, with binary presence/absence of homologous gene families representing a source of phylogenetically informative characters. Applying binary substitution models to the gene content of 26 complete animal genomes, we demonstrate that patterns of gene conservation differ markedly depending on whether gene families are defined by orthology or homology, that is, whether paralogs are excluded or included. We conclude that the placement of some deeply diverging lineages may exceed the limit of resolution afforded by the current methods based on comparisons of orthologous protein sequences, and novel approaches are required to fully capture the evolutionary signal from genes within genomes.
解析动物(后生动物)之间的关系对我们理解肌肉、肠道和神经等关键特征的起源至关重要。然而,一个被广泛接受的后生动物共识系统发育尚未出现。部分原因是,深度分化和快速进化谱系的基因组可能经历了大量的基因更替,与相关门共享的同源基因数量减少。这限制了传统的基于同源序列比对的系统发育方法的有效性。基于基因内容的系统发育分析有可能规避这种同源性要求,同源基因家族的二元存在/缺失代表了具有系统发育信息量的特征来源。我们应用二进制替换模型对 26 个完整动物基因组的基因内容进行分析,结果表明,基因保守模式因基因家族是通过同源性还是同线性定义(即是否排除或包含旁系同源物)而有显著差异。我们得出结论,一些深度分化的谱系的位置可能超出了基于同源蛋白序列比较的当前方法所提供的分辨率限制,需要新的方法来充分从基因组内的基因中捕获进化信号。