Ji Huiyue, Liu Junnan, Chen Yineng, Yu Xinyi, Luo Chenlu, Sang Luxi, Zhou Jiayu, Liao Hai
School of Life Science and Engineering, Southwest Jiaotong University, Chengdu 610031, China.
Plants (Basel). 2024 Feb 6;13(4):468. doi: 10.3390/plants13040468.
Heat shock protein 20 (HSP20) serves as a chaperone and plays roles in numerous biological processes, but the codon usage bias (CUB) of its genes has remained unexplored. This study identified 140 genes from four cruciferous species, , , , and , that were identified from the Ensembl plants database, and we subsequently investigated their CUB. As a result, the base composition analysis revealed that the overall GC content of genes was below 50%. The overall GC content significantly correlated with the constituents at three codon positions, implying that both mutation pressure and natural selection might contribute to the CUB. The relatively high ENc values suggested that the CUB of the genes in four cruciferous species was relatively weak. Subsequently, ENc exhibited a negative correlation with gene expression levels. Analyses, including ENc-plot analysis, neutral analysis, and PR2 bias, revealed that natural selection mainly shaped the CUB patterns of genes in these species. In addition, a total of 12 optimal codons (ΔRSCU > 0.08 and RSCU > 1) were identified across the four species. A neighbor-joining phylogenetic analysis based on coding sequences (CDS) showed that the 140 genes were strictly and distinctly clustered into 12 subfamilies. Principal component analysis and cluster analysis based on relative synonymous codon usage (RSCU) values supported the fact that the CUB pattern was consistent with the genetic relationship at the gene level and (or) species levels. These results will not only enrich the gene resource but also advance our understanding of the CUB of genes, which may underlie the theoretical basis for exploration of their genetic and evolutionary pattern.
热休克蛋白20(HSP20)作为一种分子伴侣,在众多生物学过程中发挥作用,但其基因的密码子使用偏好(CUB)尚未得到探索。本研究从Ensembl植物数据库中鉴定出十字花科四种植物([此处原文缺失四个物种名称])的140个基因,并随后研究了它们的CUB。结果,碱基组成分析表明,这些基因的总体GC含量低于50%。总体GC含量与三个密码子位置的组成成分显著相关,这意味着突变压力和自然选择可能都对CUB有贡献。相对较高的有效密码子数(ENc)值表明,这四种十字花科植物中这些基因的CUB相对较弱。随后,ENc与基因表达水平呈负相关。包括ENc-作图分析、中性分析和PR2偏差分析在内的分析表明,自然选择主要塑造了这些物种中这些基因的CUB模式。此外,在这四个物种中总共鉴定出12个最优密码子(ΔRSCU > 0.08且RSCU > 1)。基于编码序列(CDS)的邻接法系统发育分析表明,这140个基因被严格且明显地聚类为12个亚家族。基于相对同义密码子使用(RSCU)值的主成分分析和聚类分析支持了这样一个事实,即CUB模式在基因水平和(或)物种水平上与遗传关系一致。这些结果不仅将丰富这些基因资源,还将推进我们对这些基因CUB的理解,这可能为探索其遗传和进化模式奠定理论基础。