Solov'ev V V, Salamov V V
Mol Biol (Mosk). 1991 May-Jun;25(3):810-24.
The model of formation of alpha-helices and beta-structures determined by joint action of the three elements: N-terminal, internal and C-terminal fragments are presented. Algorithm for calculation of their localization in a given amino acid sequence was constructed on the base of this model. The preference of the fragments of the amino acid sequence to a definite type of the secondary structure was estimated on the base of corresponding average values of linear discriminant functions dsk (s = alpha, beta, k = N, in, C). The latter were constructed in the previous paper on the base of the revealed significant characteristics. These integral characteristics are used for calculating the localisation of discrete secondary structures. The total prediction for 3 states (alpha, beta, c) given 71% correctly predicted residues (for 4 states alpha, beta, c, t) 62% for the training set, consisting of 72 proteins. For the control set (15 proteins) the accuracy of prediction is about 65%. The essential advantages of this method are: 1) the possibility to localize the discrete secondary structures; 2) the high accuracy of prediction of long secondary structures (for alpha-helices approximately 90%, for beta-structures approximately 80%), which is important for the determination of the protein folding. The influence of mutation on the secondary structure of proteins was investigated. The anormally high stability of the secondary structures of immunoglobulins to mutations was revealed. This probably results from the selection during evolution of such variants of amino acid sequences, which are able to provide the functional variability of antigenic determinants, but keep invariant the tertially structure of protein.
本文提出了由N端、内部和C端片段共同作用决定的α螺旋和β结构的形成模型。基于该模型构建了计算它们在给定氨基酸序列中定位的算法。基于线性判别函数dsk(s = α,β,k = N,in,C)的相应平均值,估计了氨基酸序列片段对特定二级结构类型的偏好。后者是在先前的论文中基于所揭示的显著特征构建的。这些积分特征用于计算离散二级结构的定位。对于由72种蛋白质组成的训练集,3种状态(α,β,c)的总预测准确率为71%(对于4种状态α,β,c,t)为62%。对于对照集(15种蛋白质),预测准确率约为65%。该方法的主要优点是:1)能够定位离散二级结构;2)对长二级结构预测的高精度(对于α螺旋约为90%,对于β结构约为80%),这对确定蛋白质折叠很重要。研究了突变对蛋白质二级结构的影响。发现免疫球蛋白二级结构对突变具有异常高的稳定性。这可能是由于在进化过程中选择了这样的氨基酸序列变体,它们能够提供抗原决定簇的功能变异性,但保持蛋白质三级结构不变。