Xiao Jiamin, Lyu Rudan, He Jian, Li Mingyang, Ji Jiaxin, Cheng Jin, Xie Lei
School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, China.
College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China.
Front Plant Sci. 2022 Nov 14;13:1059379. doi: 10.3389/fpls.2022.1059379. eCollection 2022.
is one of the largest genera of Ranunculaceae with many phylogenetic problems left to be resolved. species have considerable genome size of more than 7 Gbp, and there was no whole-genome reference sequence published in this genus. This raises difficulties in acquiring nuclear genome data for its phylogenetic analysis. Previous studies based on Sanger sequencing data, plastid genome data, and nrDNA sequences did not well resolve the phylogeny of . In this study, we used genome skimming and transcriptome data to assemble the plastid genome sequences, nuclear single nucleotide polymorphisms (SNPs) datasets, and single-copy nuclear orthologous genes (SCOGs) to reconstruct the phylogenetic backbone of , and test effectiveness of these genome partitioning methods. We also further analyzed the discordance among nuclear gene trees and between plastid and nuclear phylogenies. The results showed that the SCOGs datasets, assembled from transcriptome method, well resolved the phylogenetic backbone of . The nuclear SNPs datasets from genome skimming method can also produce similar results with the SCOGs data. In contrast to the plastid phylogeny, the phylogeny resolved by nuclear genome data is more robust and better corresponds to morphological characters. Our results suggested that rapid species radiation may have generated high level of incomplete lineage sorting, which was the major cause of nuclear gene discordance. Our simulation also showed that there may have been frequent interspecific hybridization events, which led to some of the cyto-nuclear discordances in . This study not only provides the first robust phylogenetic backbone of based on nuclear genome data, but also provides suggestions of genome partitioning strategies for the phylogenomic study of other plant taxa.
是毛茛科最大的属之一,存在许多有待解决的系统发育问题。该属物种的基因组大小相当可观,超过7 Gbp,且此前没有该属的全基因组参考序列发表。这给获取其核基因组数据用于系统发育分析带来了困难。以往基于桑格测序数据、质体基因组数据和nrDNA序列的研究未能很好地解决该属的系统发育问题。在本研究中,我们使用基因组浅层测序和转录组数据来组装质体基因组序列、核单核苷酸多态性(SNP)数据集和单拷贝核直系同源基因(SCOG),以重建该属的系统发育主干,并测试这些基因组划分方法的有效性。我们还进一步分析了核基因树之间以及质体和核系统发育之间的不一致性。结果表明,通过转录组方法组装的SCOG数据集很好地解决了该属的系统发育主干。基因组浅层测序方法得到的核SNP数据集也能产生与SCOG数据相似的结果。与质体系统发育相比,由核基因组数据解析的系统发育更稳健,且与形态特征更相符。我们的结果表明,快速的物种辐射可能导致了高水平的不完全谱系分选,这是核基因不一致的主要原因。我们的模拟还表明,可能存在频繁的种间杂交事件,这导致了该属中一些细胞核与细胞质的不一致。本研究不仅基于核基因组数据提供了首个可靠的该属系统发育主干,还为其他植物类群的系统基因组学研究提供了基因组划分策略的建议。