Hlevnjak Mario, Zagrovic Bojan
Department of Structural and Computational Biology, Max F. Perutz Laboratories, University of Vienna, Campus Vienna Biocenter 5, 1030 Vienna, Austria.
Department of Structural and Computational Biology, Max F. Perutz Laboratories, University of Vienna, Campus Vienna Biocenter 5, 1030 Vienna, Austria
Nucleic Acids Res. 2015 Mar 31;43(6):3012-21. doi: 10.1093/nar/gkv166. Epub 2015 Mar 8.
It has recently been demonstrated that nucleobase-density profiles of typical mRNA coding sequences exhibit a complementary relationship with nucleobase-interaction propensity profiles of their cognate protein sequences. This finding supports the idea that the genetic code developed in response to direct binding interactions between amino acids and appropriate nucleobases, but also suggests that present-day mRNAs and their cognate proteins may be physicochemically complementary to each other and bind. Here, we computationally recode complete Methanocaldococcus jannaschii, Escherichia coli and Homo sapiens mRNA transcriptomes and analyze how much complementary matching of synonymous mRNAs can vary, while keeping protein sequences fixed. We show that for most proteins there exist cognate mRNAs that improve, but also significantly worsen the level of native matching (e.g. by 1.8 viz. 7.6 standard deviations on average for H. sapiens, respectively), with the least malleable proteins in this sense being strongly enriched in nuclear localization and DNA-binding functions. Even so, we show that the majority of recodings for most proteins result in pronounced complementarity. Our results suggest that the genetic code was designed for favorable, yet tunable compositional complementarity between mRNAs and their cognate proteins, supporting the hypothesis that the interactions between the two were an important defining element behind the code's origin.
最近有研究表明,典型mRNA编码序列的核碱基密度分布与其同源蛋白质序列的核碱基相互作用倾向分布呈现互补关系。这一发现支持了遗传密码是为响应氨基酸与合适核碱基之间的直接结合相互作用而发展起来的观点,但也表明当今的mRNA及其同源蛋白质在物理化学性质上可能相互互补并结合。在此,我们通过计算对完整的詹氏甲烷球菌、大肠杆菌和人类的mRNA转录组进行重新编码,并分析在保持蛋白质序列不变的情况下,同义mRNA的互补匹配程度能有多大差异。我们发现,对于大多数蛋白质而言,存在一些同源mRNA,它们既能提高也能显著降低天然匹配水平(例如,人类的平均分别提高1.8和降低7.6个标准差),从这个意义上说,可塑性最小的蛋白质在核定位和DNA结合功能方面高度富集。即便如此,我们表明大多数蛋白质的大多数重新编码都会导致明显的互补性。我们的结果表明,遗传密码的设计旨在实现mRNA与其同源蛋白质之间良好但可调节的组成互补性,这支持了两者之间的相互作用是密码起源背后重要的决定性因素这一假说。