Department of Mathematics and Statistics, University of Otago, Dunedin, New Zealand.
Genome Biol Evol. 2011;3:571-87. doi: 10.1093/gbe/evr050. Epub 2011 Jun 28.
Congruence is a broadly applied notion in evolutionary biology used to justify multigene phylogeny or phylogenomics, as well as in studies of coevolution, lateral gene transfer, and as evidence for common descent. Existing methods for identifying incongruence or heterogeneity using character data were designed for data sets that are both small and expected to be rarely incongruent. At the same time, methods that assess incongruence using comparison of trees test a null hypothesis of uncorrelated tree structures, which may be inappropriate for phylogenomic studies. As such, they are ill-suited for the growing number of available genome sequences, most of which are from prokaryotes and viruses, either for phylogenomic analysis or for studies of the evolutionary forces and events that have shaped these genomes. Specifically, many existing methods scale poorly with large numbers of genes, cannot accommodate high levels of incongruence, and do not adequately model patterns of missing taxa for different markers. We propose the development of novel incongruence assessment methods suitable for the analysis of the molecular evolution of the vast majority of life and support the investigation of homogeneity of evolutionary process in cases where markers do not share identical tree structures.
一致是进化生物学中广泛应用的概念,用于证明多基因系统发育或系统发生基因组学,以及协同进化、横向基因转移的研究,并作为共同进化的证据。现有的使用特征数据识别不一致或异质性的方法是为小数据集设计的,并且预计这些数据集很少不一致。同时,使用树比较评估不一致性的方法检验了树结构不相关的零假设,这对于系统发生基因组学研究可能不合适。因此,它们不适合越来越多的可用基因组序列,其中大多数来自原核生物和病毒,无论是进行系统发生基因组学分析还是研究塑造这些基因组的进化力量和事件。具体来说,许多现有的方法在处理大量基因时效果不佳,无法容纳高水平的不一致性,并且不能充分模拟不同标记缺失分类群的模式。我们建议开发新的不一致评估方法,以适用于分析绝大多数生命的分子进化,并支持在标记不具有相同树结构的情况下调查进化过程的同质性。