Carter Charles W
Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA.
Life (Basel). 2024 Jan 30;14(2):199. doi: 10.3390/life14020199.
How Nature discovered genetic coding is a largely ignored question, yet the answer is key to explaining the transition from biochemical building blocks to life. Other, related puzzles also fall inside the aegis enclosing the codes themselves. The peptide bond is unstable with respect to hydrolysis. So, it requires some form of chemical free energy to drive it. Amino acid activation and acyl transfer are also slow and must be catalyzed. All living things must thus also convert free energy and synchronize cellular chemistry. Most importantly, functional proteins occupy only small, isolated regions of sequence space. Nature evolved heritable symbolic data processing to seek out and use those sequences. That system has three parts: a memory of how amino acids behave in solution and inside proteins, a set of code keys to access that memory, and a scoring function. The code keys themselves are the genes for cognate pairs of tRNA and aminoacyl-tRNA synthetases, AARSs. The scoring function is the enzymatic specificity constant, k/k, which measures both catalysis and specificity. The work described here deepens the evidence for and understanding of an unexpected consequence of ancestral bidirectional coding. Secondary structures occur in approximately the same places within antiparallel alignments of their gene products. However, the polar amino acids that define the molecular surface of one are reflected into core-defining non-polar side chains on the other. Proteins translated from base-paired coding strands fold up inside out. Bidirectional genes thus project an inverted structural duality into the proteome. I review how experimental data root the scoring functions responsible for the origins of coding and catalyzed activation of unfavorable chemical reactions in that duality.
大自然如何发现遗传编码,这一问题在很大程度上被忽视了,然而答案却是解释从生化构件到生命转变的关键。其他相关谜题也在围绕编码本身的保护范围内。肽键在水解方面不稳定。因此,它需要某种形式的化学自由能来驱动。氨基酸活化和酰基转移也很缓慢,必须被催化。所有生物因此也必须转化自由能并同步细胞化学反应。最重要的是,功能性蛋白质仅占据序列空间中很小的、孤立的区域。大自然进化出可遗传的符号数据处理方式来寻找和利用这些序列。该系统有三个部分:关于氨基酸在溶液中和蛋白质内部行为的记忆、一组用于访问该记忆的编码密钥,以及一个评分函数。编码密钥本身就是tRNA和氨酰tRNA合成酶(AARSs)同源对的基因。评分函数是酶促特异性常数k/k,它衡量催化作用和特异性。本文所述工作加深了对祖先双向编码意外后果的证据支持和理解。二级结构在其基因产物的反平行排列中大致出现在相同位置。然而,定义一个分子表面的极性氨基酸在另一个分子上则反映为定义核心的非极性侧链。从碱基配对编码链翻译而来的蛋白质会由内而外折叠。因此,双向基因在蛋白质组中呈现出一种反向的结构二元性。我将回顾实验数据如何将负责编码起源和催化不利化学反应活化的评分函数扎根于这种二元性之中。