Larridon Isabel, Villaverde Tamara, Zuntini Alexandre R, Pokorny Lisa, Brewer Grace E, Epitawalage Niroshini, Fairlie Isabel, Hahn Marlene, Kim Jan, Maguilla Enrique, Maurin Olivier, Xanthos Martin, Hipp Andrew L, Forest Félix, Baker William J
Royal Botanic Gardens, Kew, Surrey, United Kingdom.
Systematic and Evolutionary Botany Lab, Department of Biology, Ghent University, Ghent, Belgium.
Front Plant Sci. 2020 Jan 9;10:1655. doi: 10.3389/fpls.2019.01655. eCollection 2019.
In phylogenetic studies across angiosperms, at various taxonomic levels, polytomies have persisted despite efforts to resolve them by increasing sampling of taxa and loci. The large amount of genomic data now available and statistical tools to analyze them provide unprecedented power for phylogenetic inference. Targeted sequencing has emerged as a strong tool for estimating species trees in the face of rapid radiations, lineage sorting, and introgression. Evolutionary relationships in Cyperaceae have been studied mostly using Sanger sequencing until recently. Despite ample taxon sampling, relationships in many genera remain poorly understood, hampered by diversification rates that outpace mutation rates in the loci used. The C4 clade of the genus has been particularly difficult to resolve. Previous studies based on a limited set of markers resolved relationships among species using the C3 photosynthetic pathway, but not among C4 clade taxa. We test the ability of two targeted sequencing kits to resolve relationships in the C4 clade, the universal Angiosperms-353 kit and a Cyperaceae-specific kit. Sequences of the targeted loci were recovered from data generated with both kits and used to investigate overlap in data between kits and relative efficiency of the general and custom approaches. The power to resolve shallow-level relationships was tested using a summary species tree method and a concatenated maximum likelihood approach. High resolution and support are obtained using both approaches, but high levels of missing data disproportionately impact the latter. Targeted sequencing provides new insights into the evolution of morphology in the C4 clade, demonstrating for example that the former segregate genus is polyphyletic despite its seeming morphological integrity. An unexpected result is that the - complex comprises a clade separate from and sister to the core C4 clade. Our results demonstrate that data generated with a family-specific kit do not necessarily have more power than those obtained with a universal kit, but that data generated with different targeted sequencing kits can often be merged for downstream analyses. Moreover, our study contributes to the growing consensus that targeted sequencing data are a powerful tool in resolving rapid radiations.
在被子植物的系统发育研究中,在各个分类水平上,尽管通过增加分类群和基因座的采样来解决多歧分类,但多歧分类仍然存在。现在可用的大量基因组数据以及分析这些数据的统计工具为系统发育推断提供了前所未有的能力。面对快速辐射、谱系分选和基因渗入,靶向测序已成为估计物种树的强大工具。直到最近,莎草科的进化关系大多使用桑格测序法进行研究。尽管分类群采样充足,但许多属的关系仍了解甚少,这受到多样化速率超过所用基因座突变速率的阻碍。该属的C4进化枝尤其难以解析。以前基于有限一组标记的研究解决了使用C3光合途径的物种之间的关系,但未解决C4进化枝分类群之间的关系。我们测试了两种靶向测序试剂盒解析C4进化枝关系的能力,即通用的被子植物-353试剂盒和特定于莎草科的试剂盒。从两种试剂盒生成的数据中恢复了靶向基因座的序列,并用于研究试剂盒之间的数据重叠以及通用方法和定制方法的相对效率。使用总结物种树方法和串联最大似然方法测试了解析浅层关系的能力。两种方法都获得了高分辨率和支持,但高水平的缺失数据对后者的影响尤为严重。靶向测序为C4进化枝的形态进化提供了新的见解,例如表明以前分离的属尽管形态上看似完整,但却是多系的。一个意外的结果是,-复合体包含一个与核心C4进化枝分开且为其姐妹的进化枝。我们的结果表明,使用特定于科的试剂盒生成的数据不一定比使用通用试剂盒获得的数据更有能力,但使用不同靶向测序试剂盒生成的数据通常可以合并用于下游分析。此外,我们的研究有助于越来越多的人达成共识,即靶向测序数据是解决快速辐射的有力工具。