Department of Computer Science and Engineering.
Center for Network and Data Science.
Bioinformatics. 2020 Dec 8;36(19):4876-4884. doi: 10.1093/bioinformatics/btaa603.
Most amino acids are encoded by multiple synonymous codons, some of which are used more rarely than others. Analyses of positions of such rare codons in protein sequences revealed that rare codons can impact co-translational protein folding and that positions of some rare codons are evolutionarily conserved. Analyses of their positions in protein 3-dimensional structures, which are richer in biochemical information than sequences alone, might further explain the role of rare codons in protein folding.
We model protein structures as networks and use network centrality to measure the structural position of an amino acid. We first validate that amino acids buried within the structural core are network-central, and those on the surface are not. Then, we study potential differences between network centralities and thus structural positions of amino acids encoded by conserved rare, non-conserved rare and commonly used codons. We find that in 84% of proteins, the three codon categories occupy significantly different structural positions. We examine protein groups showing different codon centrality trends, i.e. different relationships between structural positions of the three codon categories. We see several cases of all proteins from our data with some structural or functional property being in the same group. Also, we see a case of all proteins in some group having the same property. Our work shows that codon usage is linked to the final protein structure and thus possibly to co-translational protein folding.
https://nd.edu/∼cone/CodonUsage/.
Supplementary data are available at Bioinformatics online.
大多数氨基酸都由多个同义密码子编码,其中一些比其他密码子使用频率更低。对蛋白质序列中这些稀有密码子位置的分析表明,稀有密码子会影响共翻译蛋白折叠,并且一些稀有密码子的位置在进化上是保守的。对其在蛋白质三维结构中位置的分析(其比序列更富有生化信息)可能进一步解释稀有密码子在蛋白质折叠中的作用。
我们将蛋白质结构建模为网络,并使用网络中心度来衡量氨基酸的结构位置。我们首先验证了结构核心内的氨基酸是网络中心的,而表面的氨基酸则不是。然后,我们研究了保守稀有密码子、非保守稀有密码子和常用密码子所编码的氨基酸的网络中心度和结构位置之间可能存在的差异。我们发现,在 84%的蛋白质中,这三类密码子占据了显著不同的结构位置。我们研究了显示不同密码子中心度趋势的蛋白质组,即这三类密码子的结构位置之间的不同关系。我们发现了一些情况下,我们所有的数据中所有蛋白质都具有相同的结构或功能特性。此外,我们还看到了一些组中所有蛋白质都具有相同特性的情况。我们的工作表明,密码子的使用与最终的蛋白质结构有关,因此可能与共翻译蛋白折叠有关。
https://nd.edu/∼cone/CodonUsage/。
补充数据可在《生物信息学》在线获取。