Yadav Manoj Kumar, Swati D
Department of Bioinformatics, MMV, Banaras Hindu University, Varanasi-221005, INDIA.
Bioinformation. 2012;8(24):1230-9. doi: 10.6026/97320630081230. Epub 2012 Dec 8.
Codon usage bias (CUB) is an omnipresent phenomenon, which occurs in nearly all organisms. Previous studies of codon bias in Plasmodium species were based on a limited dataset. This study uses whole genome datasets for comparative genome analysis of six Plasmodium species using CUB and other related methods for the first time. Codon usage bias, compositional variation in translated amino acid frequency, effective number of codons and optimal codons are analyzed for P.falciparum, P.vivax, P.knowlesi, P.berghei, P.chabaudii and P.yoelli. A plot of effective number of codons versus GC3 shows their differential codon usage pattern arises due to a combination of mutational and translational selection pressure. The increased relative usage of adenine and thymine ending optimal codons in highly expressed genes of P.falciparum is the result of higher composition biased pressure, and usage of guanine and cytosine bases at third codon position can be explained by translational selection pressure acting on them. While higher usage of adenine and thymine bases at third codon position in optimal codons of P.vivax highlights the role of translational selection pressure apart from composition biased mutation pressure in shaping their codon usage pattern. The frequency of those amino acids that are encoded by AT ending codons are significantly high in P.falciparum due to action of high composition biased mutational pressure compared with other Plasmodium species. The CUB variation in the three rodent parasites, P.berghei, P.chabaudii and P.yoelli is strikingly similar to that of P.falciparum. The simian and human malarial parasite, P.knowlesi shows a variation in codon usage bias similar to P.vivax but on closer study there are differences confirmed by the method of Principal Component Analysis (PCA).
CDS - Coding sequences, GC1 - GC composition at first site of codon, GC2 - GC composition at second site of codon, GC3 - GC composition at third site of codon, Ala - Alanine, Arg - Arginine, Asn - Asparagine, Asp - Aspartic acid, Cys - Cysteine, Gln - Glutamine Glu - Glutamic acid Gly - Glycine His - Histidine Ile - Isoleucine Leu - Leucine Lys - Lysine Met - Methionine Phe - Phenylalanine Pro - Proline Ser - Serine Thr - Threonine Trp - Tryptophan Tyr - Tyrosine Val - Valine.
密码子使用偏好(CUB)是一种普遍存在的现象,几乎在所有生物体中都会出现。先前对疟原虫物种密码子偏好的研究是基于有限的数据集。本研究首次使用全基因组数据集,通过密码子使用偏好及其他相关方法对六种疟原虫进行比较基因组分析。对恶性疟原虫、间日疟原虫、诺氏疟原虫、伯氏疟原虫、查巴迪疟原虫和约氏疟原虫的密码子使用偏好、翻译氨基酸频率的组成变化、有效密码子数和最优密码子进行了分析。有效密码子数与GC3的关系图显示,由于突变和翻译选择压力的共同作用,它们的密码子使用模式存在差异。恶性疟原虫高表达基因中以腺嘌呤和胸腺嘧啶结尾的最优密码子相对使用频率增加是更高的组成偏向压力的结果,而第三密码子位置鸟嘌呤和胞嘧啶碱基的使用可以通过作用于它们的翻译选择压力来解释。间日疟原虫最优密码子第三密码子位置腺嘌呤和胸腺嘧啶碱基的较高使用突出了翻译选择压力在塑造其密码子使用模式中的作用,除了组成偏向突变压力之外。与其他疟原虫物种相比,由于高组成偏向突变压力的作用,在恶性疟原虫中由以AT结尾的密码子编码的那些氨基酸的频率显著较高。三种啮齿动物疟原虫,即伯氏疟原虫、查巴迪疟原虫和约氏疟原虫的密码子使用偏好变化与恶性疟原虫惊人地相似。猿类和人类疟原虫诺氏疟原虫的密码子使用偏好变化与间日疟原虫相似,但通过主成分分析(PCA)方法进行更深入研究时发现存在差异。
CDS - 编码序列,GC1 - 密码子第一位的GC组成,GC2 - 密码子第二位的GC组成,GC3 - 密码子第三位的GC组成,Ala - 丙氨酸,Arg - 精氨酸,Asn - 天冬酰胺,Asp - 天冬氨酸,Cys - 半胱氨酸,Gln - 谷氨酰胺,Glu - 谷氨酸,Gly - 甘氨酸,His - 组氨酸,Ile - 异亮氨酸,Leu - 亮氨酸,Lys - 赖氨酸, Met - 甲硫氨酸,Phe - 苯丙氨酸,Pro - 脯氨酸,Ser - 丝氨酸,Thr - 苏氨酸,Trp - 色氨酸,Tyr - 酪氨酸,Val - 缬氨酸。