Trifonov E N
Institute of Molecular Medical Sciences, Palo Alto, CA 94306.
J Mol Evol. 1994 May;38(5):543-6. doi: 10.1007/BF00178853.
Since 1929 the concept that proteins are built from subunits of certain standard size (Svedberg 1929) has been revisited several times, each time with a new demonstration that, indeed, there are certain preferred protein sizes. According to recent estimates the overrepresented sizes are close to multiples of 125 amino acid (aa) residues for eukaryotes and 150 residues for prokaryotes. To explain these preferences, a hypothesis is suggested, and quantitatively developed, on the recombinational nature of this regularity. The protein-coding sequences are assumed to evolve at some early stage via recombinational events--insertions of DNA circles of a certain optimal size. The contour lengths of the protein-coding DNA circles had to be simultaneously divisible by three and, to minimize torsional constraint, by the DNA helical repeat. With these two conditions satisfied, the calculated contour lengths of the DNA circles, 250-500 base pairs (bp), turn out to correspond well to known optimal DNA circularization sizes and to the predicted range of the protein sequence subunit sizes: 80-170 aa residues, which covers experimentally observed values. The subunit size is found to be strongly influenced by the helical repeat of DNA. The sizes 125 and 150 aa are derived when the corresponding helical repeats of DNA are set within fractions of promilles from the 10.54 bp/turn value. This fits to the experimentally estimated mean for natural mixed DNA sequences, 10.53-10.57 bp/turn.(ABSTRACT TRUNCATED AT 250 WORDS)
自1929年以来,蛋白质由特定标准大小的亚基构成这一概念(斯韦德贝里,1929年)已被多次重新审视,每次都有新的证据表明确实存在某些偏好的蛋白质大小。根据最近的估计,真核生物中过度呈现的大小接近125个氨基酸残基的倍数,原核生物中则为150个残基。为了解释这些偏好,我们提出并定量发展了一个关于这种规律性的重组性质的假说。假定蛋白质编码序列在早期通过重组事件——插入特定最佳大小的DNA环——进化。蛋白质编码DNA环的轮廓长度必须同时能被3整除,并且为了最小化扭转约束,还需能被DNA螺旋重复序列整除。满足这两个条件后,计算出的DNA环轮廓长度为250 - 500碱基对(bp),结果与已知的最佳DNA环化大小以及预测的蛋白质序列亚基大小范围:80 - 170个氨基酸残基非常吻合,该范围涵盖了实验观察值。发现亚基大小受DNA螺旋重复序列的强烈影响。当将相应的DNA螺旋重复序列设定在相对于10.54 bp/圈值的千分之几范围内时,可得出125和150个氨基酸的大小。这与天然混合DNA序列的实验估计平均值10.53 - 10.57 bp/圈相符。(摘要截取自250字)