Simmons Mark P, Freudenstein John V
The Ohio State University Herbarium, Department of Evolution, Ecology and Organismal Biology, 1315 Kinnear Road, Columbus, Ohio, 43212.
Cladistics. 2002 Jun;18(3):354-365. doi: 10.1111/j.1096-0031.2002.tb00156.x.
A phylogenetic analysis can be no better than the characters on which it is based. Just as it is inappropriate to code character states of individual characters as separate presence/absence characters, it is inappropriate to combine independent characters because not all information in the data is being utilized. Composite characters link otherwise discernible states from different characters together to form new character states. There are two related problems with this coding. First, there is a loss of hierarchic information between the reductive and composite characters when unordered states are used. Second, the linking of separate characters that occurs during the construction of composite character states can create putative synapomorphies that were not present in the separate characters. For amino acid characters, the problem may occur whenever more than one position of a codon is variable among the terminals sampled. Groups that are resolved as paraphyletic with reductive coding may be resolved as monophyletic with composite coding. The artificial character states indicated by the amino acid characters are unlikely to be congruent with the true gene tree.
系统发育分析的质量不会超过其所依据的性状。正如将单个性状的性状状态编码为单独的存在/缺失性状是不合适的一样,合并独立性状也是不合适的,因为数据中的所有信息并未得到利用。复合性状将来自不同性状的原本可辨别的状态联系在一起,形成新的性状状态。这种编码存在两个相关问题。首先,当使用无序状态时,简约性状和复合性状之间会丢失层次信息。其次,在构建复合性状状态过程中发生的单独性状的联系可能会产生单独性状中不存在的假定共近裔性状。对于氨基酸性状,只要在抽样的终端中密码子的多个位置可变,就可能出现问题。用简约编码解析为并系群的类群,用复合编码可能解析为单系群。氨基酸性状所指示的人为性状状态不太可能与真实的基因树一致。