Department of Molecular Biology and Biotechnology, Tezpur University, Tezpur, 784028 Assam, India.
Department of Computer Science and Engineering, Tezpur University, Tezpur, 784028 Assam, India.
DNA Res. 2022 Jun 25;29(4). doi: 10.1093/dnares/dsac023.
A common approach to estimate the strength and direction of selection acting on protein coding sequences is to calculate the dN/dS ratio. The method to calculate dN/dS has been widely used by many researchers and many critical reviews have been made on its application after the proposition by Nei and Gojobori in 1986. However, the method is still evolving considering the non-uniform substitution rates and pretermination codons. In our study of SNPs in 586 genes across 156 Escherichia coli strains, synonymous polymorphism in 2-fold degenerate codons were higher in comparison to that in 4-fold degenerate codons, which could be attributed to the difference between transition (Ti) and transversion (Tv) substitution rates where the average rate of a transition is four times more than that of a transversion in general. We considered both the Ti/Tv ratio, and nonsense mutation in pretermination codons, to improve estimates of synonymous (S) and non-synonymous (NS) sites. The accuracy of estimating dN/dS has been improved by considering the Ti/Tv ratio and nonsense substitutions in pretermination codons. We showed that applying the modified approach based on Ti/Tv ratio and pretermination codons results in higher values of dN/dS in 29 common genes of equal reading-frames between E. coli and Salmonella enterica. This study emphasizes the robustness of amino acid composition with varying codon degeneracy, as well as the pretermination codons when calculating dN/dS values.
一种估算蛋白质编码序列中选择作用的强度和方向的常用方法是计算 dN/dS 比值。自 1986 年 Nei 和 Gojobori 提出该方法以来,许多研究人员已经广泛使用该方法,并对其应用进行了许多批判性的评价。然而,考虑到非均匀替换率和终止密码子,该方法仍在不断发展。在对 156 株大肠杆菌中的 586 个基因的 SNPs 进行研究时,我们发现 2 倍简并密码子中的同义多态性高于 4 倍简并密码子中的同义多态性,这可能归因于转换(Ti)和颠换(Tv)替换率之间的差异,通常情况下,转换的平均速率是颠换的四倍。我们考虑了 Ti/Tv 比值和终止密码子中的无义突变,以改进同义(S)和非同义(NS)位点的估计。通过考虑 Ti/Tv 比值和终止密码子中的无义突变,提高了 dN/dS 的估计准确性。我们表明,应用基于 Ti/Tv 比值和终止密码子的修正方法会导致大肠杆菌和沙门氏菌之间的 29 个常见基因的 dN/dS 值升高。本研究强调了在计算 dN/dS 值时,氨基酸组成随密码子简并性的变化以及终止密码子的稳健性。