Xiao Xu, Chen Juyan, Ran Zhaohui, Huang Lang, Li Zhi
Key Laboratory of National Forestry and Grassland Administration on Biodiversity Conservation in Karst Mountainous Areas of Southwestern China, Guizhou Academy of Forestry, Guiyang 550005, China.
College of Forestry, Guizhou University, Guiyang 550025, China.
Genes (Basel). 2025 Jan 3;16(1):49. doi: 10.3390/genes16010049.
: Section is the most diverse group in the genus L., and this group of plants has a long history of cultivation in China as popular ornamental flowers and oil plants. Sect. plants present diverse morphological variations and complexity among species, resulting in uncertainty in the classification of species, which has resulted in a degree of inconvenience and confusion in the use of plant resources and research. : Here, We sequenced and assembled the chloroplast genomes of 6 sect. a and performed comparative chloroplast genome analysis and phylogenetic studies combined with 15 existing sect. plants. : The chloroplast genome of 21 species in sect. species were quadripartite with length of 156,587-157,068 bp base pairs (bp), and a highly conserved and moderately differentiated chloroplast genome arrangement. The 21 sect. chloroplast genomes were similar to those of angiosperms, with high consistency in gene number, gene content and gene structure. After the annotation process, we identified a total of 132 genes, specifically 87 sequences coding for proteins (CDS), 37 transfer RNA (tRNA) genes, and 8 ribosomal RNA (rRNA) genes. The 1 gene in 21 species of the sect. was present only in the small single-copy/inverted repeat of a (SSC/IRa) region. Sequence variation was greater in the large single-copy (LSC) region than in the IR region, and the majority of the protein-coding genes presented high codon preferences. The chloroplast genomes of 21 plant species exhibit relatively conserved SC (single copy region)/IR (inverted repeat region) boundaries. We detected a total of 2975 single sequence repeats (SSRs) as well as 833 dispersed nuclear elements (INEs). Among these SSRs, A/T repeats and AT/AT repeats dominated, while among INEs, forward repeats and palindromic repeats predominated. Codon usage frequencies were largely similar, with 30 high-frequency codons detected. Comparative analysis revealed five hotspot regions (16, J, 33, 8, and 16) and two gene intervals (H-I and D-A) in the cp genome, which can be used as potential molecular markers. In addition, the phylogenetic tree constructed from the chloroplast genome revealed that these 21 species and aggregated into a single branch, which was further subdivided into two evolutionarily independent sub-branches. : It was confirmed that sect. and Abel are closely related in Camellia genus. These findings will enhance our knowledge of the sect. of plants, deepen our understanding of their genetic characteristics and phylogenetic pathways, and provide strong support for the scientific development and rational utilization of the plant resources of the sect. .
组是山茶属中最多样化的类群,该类群植物在中国作为流行的观赏花卉和油料植物有着悠久的栽培历史。组植物在物种间呈现出多样的形态变异和复杂性,导致物种分类存在不确定性,这在植物资源利用和研究中造成了一定程度的不便和混乱。在此,我们对6种组植物的叶绿体基因组进行了测序和组装,并结合15种现有的组植物进行了叶绿体基因组比较分析和系统发育研究。组21个物种的叶绿体基因组呈四分体结构,长度为156,587 - 157,068碱基对(bp),叶绿体基因组排列高度保守且中度分化。21个组叶绿体基因组与被子植物的相似,在基因数量、基因内容和基因结构上具有高度一致性。经过注释过程,我们共鉴定出132个基因,具体为87个编码蛋白质的序列(CDS)、37个转运RNA(tRNA)基因和8个核糖体RNA(rRNA)基因。组21个物种中的1个基因仅存在于a(SSC/IRa)区域的小单拷贝/反向重复序列中。大单拷贝(LSC)区域的序列变异大于IR区域,且大多数蛋白质编码基因呈现出较高的密码子偏好性。21种植物的叶绿体基因组表现出相对保守的SC(单拷贝区域)/IR(反向重复区域)边界。我们共检测到2975个单序列重复(SSR)以及833个分散核元件(INE)。在这些SSR中,A/T重复和AT/AT重复占主导,而在INE中,正向重复和回文重复占主导。密码子使用频率在很大程度上相似,共检测到30个高频密码子。比较分析揭示了叶绿体基因组中的五个热点区域(16、J、33、8和16)和两个基因间隔(H - I和D - A),可作为潜在的分子标记。此外,由叶绿体基因组构建的系统发育树显示,这21个物种与聚集在一个单分支中,该分支进一步细分为两个进化上独立的子分支。证实了组与阿贝尔组在山茶属中密切相关。这些发现将增进我们对组植物的了解,加深我们对其遗传特征和系统发育途径的认识,并为组植物资源的科学开发和合理利用提供有力支持。