Hassan Sameer, Mahalingam Vasantha, Kumar Vanaja
Tuberculosis Research Centre Indian Council of Medical Research, Chennai 600 031, India.
Adv Bioinformatics. 2009;2009:316936. doi: 10.1155/2009/316936. Epub 2010 Feb 1.
Synonymous codon usage of protein coding genes of thirty two completely sequenced mycobacteriophage genomes was studied using multivariate statistical analysis. One of the major factors influencing codon usage is identified to be compositional bias. Codons ending with either C or G are preferred in highly expressed genes among which C ending codons are highly preferred over G ending codons. A strong negative correlation between effective number of codons (Nc) and GC3s content was also observed, showing that the codon usage was effected by gene nucleotide composition. Translational selection is also identified to play a role in shaping the codon usage operative at the level of translational accuracy. High level of heterogeneity is seen among and between the genomes. Length of genes is also identified to influence the codon usage in 11 out of 32 phage genomes. Mycobacteriophage Cooper is identified to be the highly biased genome with better translation efficiency comparing well with the host specific tRNA genes.
运用多变量统计分析方法,对32个完全测序的分枝杆菌噬菌体基因组中蛋白质编码基因的同义密码子使用情况进行了研究。确定影响密码子使用的主要因素之一是组成性偏差。在高表达基因中,以C或G结尾的密码子更受青睐,其中以C结尾的密码子比以G结尾的密码子更受高度偏爱。还观察到密码子有效数量(Nc)与GC3s含量之间存在强烈的负相关,表明密码子使用受基因核苷酸组成的影响。翻译选择也被确定在塑造翻译准确性水平上起作用的密码子使用方面发挥作用。基因组之间和内部存在高度的异质性。还确定基因长度会影响32个噬菌体基因组中11个基因组的密码子使用。分枝杆菌噬菌体库珀被确定为具有高度偏差的基因组,与宿主特异性tRNA基因相比,其翻译效率更高。