Ogden T Heath, Rosenberg Michael S
Department of Biological Sciences, Idaho State University, Idaho 83209, USA.
Syst Biol. 2007 Apr;56(2):182-93. doi: 10.1080/10635150701281102.
Direct optimization frameworks for simultaneously estimating alignments and phylogenies have recently been developed. One such method, implemented in the program POY, is becoming more common for analyses of variable length sequences (e.g., analyses using ribosomal genes) and for combined evidence analyses (morphology + multiple genes). Simulation of sequences containing insertion and deletion events was performed in order to directly compare a widely used method of multiple sequence alignment (ClustalW) and subsequent parsimony analysis in PAUP* with direct optimization via POY. Data sets were simulated for pectinate, balanced, and random tree shapes under different conditions (clocklike, non-clocklike, and ultrametric). Alignment accuracy scores for the implied alignments from POY and the multiple sequence alignments from ClustalW were calculated and compared. In almost all cases (99.95%), ClustalW produced more accurate alignments than POY-implied alignments, judged by the proportion of correctly identified homologous sites. Topological accuracy (distance to the true tree) for POY topologies and topologies generated under parsimony in PAUP* from the ClustalW alignments were also compared. In 44.94% of the cases, Clustal alignment tree reconstructions via PAUP* were more accurate than POY, whereas in 16.71% of the cases POY reconstructions were more topologically accurate (38.38% of the time they were equally accurate). Comparisons between POY hypothesized alignments and the true alignments indicated that, on average, as alignment error increased, topological accuracy decreased.
最近已经开发出用于同时估计比对和系统发育的直接优化框架。其中一种方法在程序POY中实现,对于可变长度序列的分析(例如,使用核糖体基因的分析)以及组合证据分析(形态学+多个基因)变得越来越普遍。为了直接比较广泛使用的多序列比对方法(ClustalW)以及随后在PAUP中进行的简约分析与通过POY进行的直接优化,对包含插入和缺失事件的序列进行了模拟。在不同条件(钟形、非钟形和超度量)下针对栉状、平衡和随机树形模拟了数据集。计算并比较了POY隐含比对和ClustalW多序列比对的比对准确性得分。几乎在所有情况下(99.95%),根据正确识别的同源位点比例判断,ClustalW产生的比对比POY隐含比对更准确。还比较了POY拓扑结构和通过PAUP中的简约法从ClustalW比对生成的拓扑结构的拓扑准确性(与真实树的距离)。在44.94%的情况下,通过PAUP*进行的Clustal比对树重建比POY更准确,而在16.71%的情况下POY重建在拓扑上更准确(38.38%的时间它们同样准确)。POY假设比对与真实比对之间的比较表明,平均而言,随着比对误差增加,拓扑准确性降低。