Yang Jinhong, Jiang Wei, Huang Jiayue, Peng Yunwu, Yang Fang, Kong Weiqing
Shaanxi key laboratory of sericulture, Ankang University, Ankang, 725000, China.
School of Modern Agriculture & Biotechnology, Ankang University, Ankang, 725000, China.
BMC Genomics. 2025 Jul 1;26(1):614. doi: 10.1186/s12864-025-11823-3.
Mulberry ( L.) is an economically important plant with abundant resources distributed around the world. Mulberries now have different breeding directions and uses of leaves or fruits, and additionally wild mulberries in natural environments. The classification of mulberry species is inconsistent, as previous phylogenetic and taxonomic studies using morphological and molecular markers have yielded low resolution. Additionally, the phylogenetic relationships among the three uses haven’t been studied to date. Herein, 25 plants representing the three uses were selected, their chloroplast (cp.) genomes were sequenced and analyzed, including cp. genome structure, features and phylogenetic relationships using cp. genomes and SSR markers.
The cp. genome sequences of 25 plants were determined to be 159,008–159,346 bp in length, which included a pair of inverted repeats of 25,654, 25,678 or 25,690 bp that separated a large single copy region of 87,912–88,142 bp and a small single copy region of 19,740–19,853 bp. Annotation showed that they contained a total of 129 genes: 84 protein-coding, 37 tRNA and eight rRNA genes. Comparative genomics showed that the cp. sequence among was relatively conserved, while there were still some high variation regions, and SSR loci with length or nucleotide content polymorphism. The phylogenetic trees constructed using maximum likelihood analysis and Bayesian inference method based on the cp. genome sequences, and the Neighbor-Joining dendrogram based on SSRs, all suggested that the species in this study could be distinguished as three clades, consistent with their uses: leaves, fruits and wild mulberries. In addition, some specific SSRs distributed in the clades may be useful for classification and identification of plants.
This study obtained the cp. genomes of 25 plants and provided the first comparative cp. genome analysis of species categorized by uses, identifying specific SSR markers for classification, which enhanced our comprehension on their classification and phylogenetic relationships.
The online version contains supplementary material available at 10.1186/s12864-025-11823-3.
桑(桑属)是一种具有重要经济价值的植物,资源丰富,分布于世界各地。目前桑树有不同的育种方向以及叶用或果用类型,此外还有自然环境中的野生桑树。桑树物种的分类并不一致,因为先前使用形态学和分子标记的系统发育和分类学研究分辨率较低。此外,这三种用途之间的系统发育关系迄今为止尚未得到研究。在此,选择了代表这三种用途的25种植物,对其叶绿体(cp.)基因组进行测序和分析,包括cp.基因组结构、特征以及利用cp.基因组和SSR标记分析系统发育关系。
25种植物的cp.基因组序列长度确定为159,008 - 159,346 bp,其中包括一对长度为25,654、25,678或25,690 bp的反向重复序列,它们将一个87,912 - 88,142 bp的大单拷贝区域和一个19,740 - 19,853 bp的小单拷贝区域分开。注释显示它们总共包含129个基因:84个蛋白质编码基因、37个tRNA基因和8个rRNA基因。比较基因组学表明,cp.序列在种间相对保守,但仍存在一些高变异区域以及具有长度或核苷酸含量多态性的SSR位点。基于cp.基因组序列使用最大似然分析和贝叶斯推断方法构建的系统发育树,以及基于SSR构建的邻接树状图,均表明本研究中的物种可分为三个分支,与其用途一致:叶用、果用和野生桑树。此外,分布在这些分支中的一些特定SSR可能有助于桑树植物的分类和鉴定。
本研究获得了25种植物的cp.基因组,并首次对按用途分类的桑树物种进行了比较cp.基因组分析,鉴定出用于分类的特定SSR标记,增强了我们对其分类和系统发育关系的理解。
在线版本包含可在10.1186/s12864 - 025 - 11823 - 3获取的补充材料。