Oliva B, Bates P A, Querol E, Avilés F X, Sternberg M J
Institut de Biologia Fonamental, Universitat Autònoma de Barcelona, Spain.
J Mol Biol. 1997 Mar 7;266(4):814-30. doi: 10.1006/jmbi.1996.0819.
Conformational clusters and consensus sequences for protein loops have been derived by computational analysis of their structures in a non-redundant set of 233 proteins with less than 25% sequence homology (X-ray resolution better than 2.5 A). Loops have been classified into five types (alpha-alpha, beta-beta links, beta-beta hairpins, alpha-beta and beta-alpha) according to the secondary structures they embrace. Four variables have been used to describe the loop geometry, three angles and one distance between the secondary structure elements embracing the loop. Ramachandran angles (phi, psi) are used to define the loop conformations within each brace geometry. All loops from the non-redundant set have been clustered by means of these geometric features. A total of 56 classes (9 alpha-alpha, 11 beta-beta links, 14 beta-beta hairpins, 13 alpha-beta and 9 beta-alpha) were identified with consensus Ramachandran angles in the loops. These classes were divided into subclasses based on the brace geometry. This clustering procedure captures most of the clusters analysed by predominantly visual inspection methods and finds other clusters that have hitherto not been described. Consensus sequence patterns were identified for the subclasses. An extensive characterisation of loop conformations has therefore been achieved and the computational approach is readily open to the incorporation of information from newly determined structures. These clusters should also enhance model building by comparison studies.
通过对一组233个序列同源性低于25%(X射线分辨率优于2.5埃)的非冗余蛋白质结构进行计算分析,得出了蛋白质环的构象簇和共有序列。根据环所包含的二级结构,环被分为五种类型(α-α、β-β连接、β-β发夹、α-β和β-α)。使用四个变量来描述环的几何形状,即三个角度和环所包含的二级结构元件之间的一个距离。拉马钱德兰角(φ,ψ)用于定义每个支撑几何形状内的环构象。通过这些几何特征对非冗余集中的所有环进行了聚类。共识别出56个类别(9个α-α、11个β-β连接、14个β-β发夹、13个α-β和9个β-α),其环中具有共有拉马钱德兰角。这些类别根据支撑几何形状分为子类。这种聚类过程捕获了通过主要的目视检查方法分析的大多数簇,并发现了迄今未描述的其他簇。为这些子类确定了共有序列模式。因此,已经实现了对环构象的广泛表征,并且该计算方法很容易纳入来自新确定结构的信息。这些簇也应通过比较研究增强模型构建。