系统发生基因组学中的排列不确定性校正。
Accounting for alignment uncertainty in phylogenomics.
机构信息
Department of Biology, University of Virginia, Charlottesville, Virginia, United States of America.
出版信息
PLoS One. 2012;7(1):e30288. doi: 10.1371/journal.pone.0030288. Epub 2012 Jan 17.
Uncertainty in multiple sequence alignments has a large impact on phylogenetic analyses. Little has been done to evaluate the quality of individual positions in protein sequence alignments, which directly impact the accuracy of phylogenetic trees. Here we describe ZORRO, a probabilistic masking program that accounts for alignment uncertainty by assigning confidence scores to each alignment position. Using the BALIBASE database and in simulation studies, we demonstrate that masking by ZORRO significantly reduces the alignment uncertainty and improves the tree accuracy.
多序列比对中的不确定性对系统发育分析有很大影响。人们很少评估蛋白质序列比对中各个位置的质量,而这些位置直接影响系统发育树的准确性。在这里,我们描述了 ZORRO,这是一个概率掩蔽程序,通过为每个比对位置分配置信分数来考虑比对不确定性。使用 BALIBASE 数据库和模拟研究,我们证明 ZORRO 的掩蔽显著降低了比对的不确定性并提高了树的准确性。