Ycas M
Department of Microbiology and Immunology, State University of New York, Syracuse 13210.
J Protein Chem. 1990 Apr;9(2):177-200. doi: 10.1007/BF01025309.
Using only data on sequence, a method of computing a low-resolution tertiary structure of a protein is described. The steps are: (a) Estimate the distances of individual residues from the centroid of the molecule, using data on hydrophobicity and additional geometrical constraints. (b) Using these distances, construct a two-valued matrix whose elements, the distances between residues, are greater or less than R, the radius of the molecule. (c) Optimize to obtain a three-dimensional structure. This procedure requires modest computing facilities and is applicable to proteins with 164 residues and presumably more. It produces structures with r (correlation between inter-residue distances in the computed and native structures) between 0.5 and 0.7. Furthermore, correct inference of two or three long-range contacts suffices to yield structures with r values of 0.8-0.9. Because segments forming parallel or antiparallel folding structures intersect the radius vector at similar angles, from centroidal point distances it is possible to infer some of these long-range contacts by an elaboration of the procedure used to construct the input matrix. A criterion is also described which can be used to determine the quality of a proposed input matrix even when the native structure is not known.
本文描述了一种仅使用序列数据来计算蛋白质低分辨率三级结构的方法。步骤如下:(a) 利用疏水性数据和其他几何约束条件,估计各个残基到分子质心的距离。(b) 根据这些距离,构建一个二值矩阵,其元素(残基之间的距离)大于或小于分子半径R。(c) 进行优化以获得三维结构。该程序需要适度的计算设备,适用于含有164个残基及可能更多残基的蛋白质。它生成的结构中,计算结构与天然结构之间的残基间距离的相关性r在0.5至0.7之间。此外,正确推断两到三个长程接触足以产生r值为0.8 - 0.9的结构。由于形成平行或反平行折叠结构的片段与半径向量以相似角度相交,根据质心距离,通过对构建输入矩阵所用程序的细化,可以推断出一些此类长程接触。还描述了一种标准,即使在天然结构未知的情况下,也可用于确定所提出的输入矩阵的质量。