Chen Zhiyin, Wang Zixu, Zhou Wei, Liu Sijie, Xiao Yuxin, Gong Yihui
College of Agriculture and Biotechnology, Hunan University of Humanities, Science and Technology, Loudi, Hunan, China.
Key Laboratory of Characteristic Agricultural Resource Development and Quality Safety Control in Hunan Province, Hunan University of Humanities, Science and Technology, Loudi, Hunan, China.
Front Plant Sci. 2025 Jun 13;16:1604404. doi: 10.3389/fpls.2025.1604404. eCollection 2025.
This study reports for the first time the complete sequence characteristics of the mitochondrial genome of the tea plant cultivar Camellia sinensis cv. 'Baihaozao'. It systematically unveils its multi-chromosomal structure, RNA editing patterns, and adaptive evolutionary mechanisms, providing critical theoretical insights into the structural complexity and evolutionary mechanisms of the tea plant mitochondrial genome.
The mitochondrial genome was fully analyzed using genome sequencing and annotation techniques. RNA editing sites were predicted to evaluate editing patterns. Codon usage bias analysis was conducted to identify high-frequency codons. Repeat sequence analysis was used to characterize dispersed and tandem repeats. Adaptive evolutionary analysis, based on Ka/Ks ratios, was performed to investigate gene selection pressures.
The mitochondrial genome consists of 11 linear chromosomes, with a total length of 909,843 bp and a GC content of 45.62%. A total of 73 functional genes were annotated, among which 14 variable genes (e.g., ribosomal protein coding genes) retain intact functions without pseudogenization, which is rare among Theaceae plants. RNA editing site prediction revealed significant spatial heterogeneity, with the cox1 gene being a hotspot containing 19 editing sites. Approximately 58.49% of editing events were concentrated on the second base of codons, and 48.61% of the sites resulted in amino acid changes from hydrophilic to hydrophobic. Codon usage bias analysis showed significant enrichment of high-frequency codons, including UUU (phenylalanine), AUU (isoleucine), and UUC (phenylalanine). The genome's repeat sequences were predominantly dispersed repeats (70.6%), with forward and palindromic repeats of 30-40 bp being dominant. Tandem repeats exhibited significant distribution heterogeneity among chromosomes. Adaptive evolution analysis showed that most PCGs (protein-coding genes) had Ka/Ks ratios below 1 (ranging from 0.07 to 0.78), with the atp9 gene showing the lowest ratio (0.07), while the mttB gene exhibited a significantly higher Ka/Ks ratio of 3.48. Additionally, 1.62% of the mitochondrial genome sequence was homologous to the chloroplast genome, carrying 26 complete functional genes, including 15 tRNA and 2 rRNA genes.
Codon usage bias may be related to mutation pressure due to the high AT content of the genome or reflect adaptive selection pressures for translational efficiency. The Ka/Ks results align with the widespread purifying selection observed in mitochondrial genomes, while the high Ka/Ks ratio of the mttB gene suggests it might be under positive selection to adapt to environmental pressures. The evolutionary evidence of inter-organelle gene transfer highlights the homologous sequences between mitochondria and chloroplasts. Overall, these findings systematically elucidate the adaptive evolutionary mechanisms and functional regulation of the tea plant mitochondrial genome.
本研究首次报道了茶树品种‘白毫早’线粒体基因组的完整序列特征。系统地揭示了其多染色体结构、RNA编辑模式和适应性进化机制,为茶树线粒体基因组的结构复杂性和进化机制提供了关键的理论见解。
使用基因组测序和注释技术对线粒体基因组进行全面分析。预测RNA编辑位点以评估编辑模式。进行密码子使用偏好性分析以识别高频密码子。使用重复序列分析来表征分散重复和串联重复。基于Ka/Ks比值进行适应性进化分析,以研究基因选择压力。
线粒体基因组由11条线性染色体组成,全长909,843 bp,GC含量为45.62%。共注释了73个功能基因,其中14个可变基因(如核糖体蛋白编码基因)保留完整功能且无假基因化现象,这在山茶科植物中较为罕见。RNA编辑位点预测显示出显著的空间异质性,cox1基因是一个热点,包含19个编辑位点。约58.49%的编辑事件集中在密码子的第二位,48.61%的位点导致氨基酸从亲水性变为疏水性。密码子使用偏好性分析显示高频密码子显著富集,包括UUU(苯丙氨酸)、AUU(异亮氨酸)和UUC(苯丙氨酸)。基因组的重复序列主要是分散重复(70.6%),30 - 40 bp的正向和回文重复为主。串联重复在染色体间表现出显著的分布异质性。适应性进化分析表明,大多数蛋白质编码基因的Ka/Ks比值低于1(范围为0.07至0.78),atp9基因的比值最低(0.07),而mttB基因的Ka/Ks比值显著较高,为3.48。此外,线粒体基因组序列的1.62%与叶绿体基因组同源,携带26个完整的功能基因,包括15个tRNA和2个rRNA基因。
密码子使用偏好性可能与基因组高AT含量导致的突变压力有关,或反映了对翻译效率的适应性选择压力。Ka/Ks结果与线粒体基因组中普遍存在的纯化选择一致,而mttB基因的高Ka/Ks比值表明它可能受到正选择以适应环境压力。细胞器间基因转移的进化证据突出了线粒体和叶绿体之间的同源序列。总体而言,这些发现系统地阐明了茶树线粒体基因组的适应性进化机制和功能调控。