Fitch W M, Yasunobu K T
J Mol Evol. 1975 Jun 9;5(1):1-24. doi: 10.1007/BF01732010.
The common but generally overlooked problem of how best to construct phylogenies from orthologous amino acid sequences, when their alignment requires the placement therein of gaps denoting insertions/deletions in the evolutionary history of their genes since their common ancestor, has been studied. Three diverse methods were examined: 1. each missing residue in a gap is weighted as equivalent to the average number of minimum nucleotide replacements in known conjugate amino acid pairs of those same two sequences, which weight necessarily differs for each pair of sequences; 2. each missing residue in a gap is weighted as equivalent to a fixed number of nucleotide replacements; and 3. each gap, regardless of length, is weighted as equivalent to a fixed number of nucleotide replacements. For the flavodoxins, each method yielded a different best tree and suggests that the choice of method may be crucial. For the plant ferredoxins, all methods give results inconsistent with botanical classification and suggests the sequences may not all be orthologous. For the bacterial ferredoxins, the method was less germane than the actual weight used, five different best trees being obtained depending upon the weight. The best tree for all ferredoxins (prokaryotic plus eukaryotic) combined proved to be greatly dependent upon the gap locations with several reasonable aligments yielding different best trees. They also suggest that functional equivalence may well prove to be a poor guide to which residues have a common ancestral codon. The rubredoxin sequences show that a partial internal gene duplication occurred in the Pseudomonas line, probably very soon after its divergence from the other genera. Together, the results clearly indicate that the phylogenetic answer one gets may greatly depend upon how one treats the gaps but they fail to indicate what treatment may be best.
一个常见但通常被忽视的问题是,当从直系同源氨基酸序列构建系统发育树时,其比对需要在其中放置空位以表示自它们的共同祖先以来其基因进化历史中的插入/缺失,对此已进行了研究。研究了三种不同的方法:1. 空位中的每个缺失残基的权重等同于这两条相同序列中已知共轭氨基酸对的最小核苷酸替换平均数,该权重必然因每对序列而异;2. 空位中的每个缺失残基的权重等同于固定数量的核苷酸替换;3. 每个空位,无论长度如何,权重等同于固定数量的核苷酸替换。对于黄素氧还蛋白,每种方法都产生了不同的最优树,这表明方法的选择可能至关重要。对于植物铁氧还蛋白,所有方法给出的结果都与植物分类不一致,这表明这些序列可能并非全部是直系同源的。对于细菌铁氧还蛋白,该方法不如实际使用的权重相关,根据权重获得了五种不同的最优树。所有铁氧还蛋白(原核生物加真核生物)组合的最优树被证明极大地依赖于空位位置,几种合理的比对产生了不同的最优树。它们还表明,功能等效性很可能被证明是判断哪些残基具有共同祖先密码子的糟糕指南。红氧还蛋白序列表明,在假单胞菌系中发生了部分内部基因复制,可能在其与其他属分化后不久。总之,结果清楚地表明,人们得到的系统发育答案可能很大程度上取决于如何处理空位,但它们未能表明哪种处理可能是最佳的。