Wang Yuyu, Zhou Xiaofan, Yang Ding, Rokas Antonis
Department of Entomology, China Agricultural University, Beijing, China Department of Biological Sciences, Vanderbilt University.
Department of Biological Sciences, Vanderbilt University.
Genome Biol Evol. 2015 Nov 25;7(12):3463-71. doi: 10.1093/gbe/evv235.
Comparison of individual gene trees in several recent phylogenomic studies from diverse lineages has revealed a surprising amount of topological conflict or incongruence, but we still know relatively little about its distribution across the tree of life. To further our understanding of incongruence, the factors that contribute to it and how it can be ameliorated, we examined its distribution in a clade of 20 Culicidae mosquito species through the reconstruction and analysis of the phylogenetic histories of 2,007 groups of orthologous genes. Levels of incongruence were generally low, the three exceptions being the internodes concerned with the branching of Anopheles christyi, with the branching of the subgenus Anopheles as well as the already reported incongruence within the Anopheles gambiae species complex. Two of these incongruence events (A. gambiae species complex and A. christyi) are likely due to biological factors, whereas the third (subgenus Anopheles) is likely due to analytical factors. Similar to previous studies, the use of genes or internodes with high bootstrap support or internode certainty values, both of which were positively correlated with gene alignment length, substantially reduced the observed incongruence. However, the clade support values of the internodes concerned with the branching of the subgenus Anopheles as well as within the A. gambiae species complex remained very low. Based on these results, we infer that the prevalence of incongruence in Culicidae mosquitoes is generally low, that it likely stems from both analytical and biological factors, and that it can be ameliorated through the selection of genes with strong phylogenetic signal. More generally, selection of genes with strong phylogenetic signal may be a general empirical solution for reducing incongruence and increasing the robustness of inference in phylogenomic studies.
在最近几项针对不同谱系的系统发育基因组学研究中,对单个基因树的比较揭示了数量惊人的拓扑冲突或不一致性,但我们对其在生命之树上的分布仍然知之甚少。为了进一步了解不一致性、导致不一致性的因素以及如何改善这种情况,我们通过重建和分析2007组直系同源基因的系统发育历史,研究了其在20种蚊科蚊子分支中的分布情况。不一致性水平总体较低,三个例外情况分别是与克氏按蚊分支相关的节间、按蚊亚属的分支以及冈比亚按蚊物种复合体中已报道的不一致性。其中两个不一致性事件(冈比亚按蚊物种复合体和克氏按蚊)可能是由生物学因素导致的,而第三个(按蚊亚属)可能是由分析因素导致的。与之前的研究类似,使用具有高自展支持度或节间确定性值的基因或节间(这两者都与基因比对长度呈正相关),显著降低了观察到的不一致性。然而,与按蚊亚属分支相关的节间以及冈比亚按蚊物种复合体内的分支支持值仍然非常低。基于这些结果,我们推断蚊科蚊子中不一致性的普遍程度通常较低,它可能源于分析因素和生物学因素,并且可以通过选择具有强系统发育信号的基因来改善。更一般地说,选择具有强系统发育信号的基因可能是减少不一致性并提高系统发育基因组学研究推断稳健性的一个通用经验性解决方案。