Rooman M J, Wodak S J
Unité de Conformation des Macromolécules Biologiques, Université Libre de Bruxelles, Belgium.
Biochemistry. 1992 Oct 27;31(42):10239-49. doi: 10.1021/bi00157a010.
It is investigated whether protein segments predicted to have a well-defined conformational preference in the absence of tertiary interactions are conserved in families of homologous proteins. The prediction method follows the procedures of Rooman, M., Kocher, J.-P., and Wodak, S. (preceding paper in this issue). It uses a knowledge-based force field that incorporates only local interactions along the sequence and identifies segments whose lowest energy structure displays a sizable energy gap relative to other computed conformations. In 13 of the protein families and subfamilies considered that are sufficiently homologous to have similar 3D structures, at least one region is consistently predicted as having the same preferred conformation in virtually all family members. These regions are between 4 and 26 residues long. They are often located at chain ends and correspond primarily to segments of secondary structure heavily involved in interactions with the rest of the protein, suggesting that they could act as nuclei around which other parts of the structure would assemble. Experimental data on early folding intermediates or on protein fragments with appreciable structure in aqueous solution are available for more than half of the protein families. Comparison of our results with these data is quite favorable. They reveal that each of the experimentally identified early formed, or independently stable, substructures harbors at least one of the segments consistently predicted as having a preferred conformation by our procedure. The implications of our findings for the conservation of folding pathways in homologous proteins are discussed.
研究了在不存在三级相互作用时预测具有明确构象偏好的蛋白质片段在同源蛋白质家族中是否保守。预测方法遵循罗曼、M.、科赫尔、J.-P.和沃达克、S.(本期前文)的程序。它使用一种基于知识的力场,该力场仅纳入沿序列的局部相互作用,并识别其最低能量结构相对于其他计算构象显示出相当大能量差距的片段。在所考虑的13个蛋白质家族和亚家族中,它们具有足够的同源性以具有相似的三维结构,在几乎所有家族成员中,至少有一个区域始终被预测为具有相同的优选构象。这些区域长度在4到26个残基之间。它们通常位于链端,主要对应于大量参与与蛋白质其余部分相互作用的二级结构片段,这表明它们可以作为结构的其他部分围绕其组装的核心。超过一半的蛋白质家族有关于早期折叠中间体或在水溶液中具有明显结构的蛋白质片段的实验数据。将我们的结果与这些数据进行比较非常有利。结果表明,每个通过实验确定的早期形成的或独立稳定的亚结构都包含至少一个通过我们的程序始终被预测为具有优选构象的片段。讨论了我们的发现对同源蛋白质折叠途径保守性的影响。