Zheng Yuchi, Wiens John J
Department of Herpetology, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu 610041, China; Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721-088, USA.
Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721-088, USA.
Mol Phylogenet Evol. 2016 Jan;94(Pt B):537-547. doi: 10.1016/j.ympev.2015.10.009. Epub 2015 Oct 22.
Two common approaches for estimating phylogenies in species-rich groups are to: (i) sample many loci for few species (e.g. phylogenomic approach), or (ii) sample many species for fewer loci (e.g. supermatrix approach). In theory, these approaches can be combined to simultaneously resolve both higher-level relationships (with many genes) and species-level relationships (with many taxa). However, fundamental questions remain unanswered about this combined approach. First, will higher-level relationships more closely resemble those estimated from many genes or those from many taxa? Second, will branch support increase for higher-level relationships (relative to the estimate from many taxa)? Here, we address these questions in squamate reptiles. We combined two recently published datasets, one based on 44 genes for 161 species, and one based on 12 genes for 4161 species. The likelihood-based tree from the combined matrix (52 genes, 4162 species) shared more higher-level clades with the 44-gene tree (90% vs. 77% shared). Branch support for higher level-relationships was marginally higher than in the 12-gene tree, but lower than in the 44-gene tree. Relationships were apparently not obscured by the abundant missing data (92% overall). We provide a time-calibrated phylogeny based on extensive sampling of genes and taxa as a resource for comparative studies.
(i)对少数物种采样多个基因座(例如系统发育基因组学方法),或(ii)对较少基因座采样多个物种(例如超矩阵方法)。理论上,这些方法可以结合起来,同时解析更高层次的关系(使用多个基因)和物种层次的关系(使用多个分类单元)。然而,关于这种组合方法的一些基本问题仍然没有答案。第一,更高层次的关系会更类似于从多个基因估计的关系还是从多个分类单元估计的关系?第二,相对于从多个分类单元估计的结果,更高层次关系的分支支持度会增加吗?在这里,我们在有鳞目爬行动物中解决这些问题。我们结合了两个最近发表的数据集,一个基于161个物种的44个基因,另一个基于4161个物种的12个基因。合并矩阵(52个基因,4162个物种)的基于似然法的树与44个基因的树共享更多的高层次分支(共享率分别为90%和77%)。更高层次关系的分支支持度略高于12个基因的树,但低于44个基因的树。关系显然没有被大量的缺失数据(总体缺失率92%)所掩盖。我们基于对基因和分类单元的广泛采样提供了一个时间校准的系统发育树,作为比较研究的资源。