Russell R B, Sasieni P D, Sternberg M J
Biomolecular Modelling Laboratory, Lincoln's Inn Fields, PO Box 123, London WC2A 3PX, UK.
J Mol Biol. 1998 Oct 2;282(4):903-18. doi: 10.1006/jmbi.1998.2043.
A method is presented to assess the significance of binding site similarities within superimposed protein three-dimensional (3D) structures and applied to all similar structures in the Protein Data Bank. For similarities between 3D structures lacking significant sequence similarity, the important distinction was made between remote homology (an ancient common ancestor) and analogy (likely convergence to a folding motif) according to the structural classification of proteins (SCOP) database. Supersites were defined as structural locations on groups of analogous proteins (i.e. superfolds) showing a statistically significant tendency to bind substrates despite little evidence of a common ancestor for the proteins considered. We identify three potentially new superfolds containing supersites: ferredoxin-like folds, four-helical bundles and double-stranded beta helices. In addition, the method quantifies binding site similarities within homologous proteins and previously identified supersites such as that found in the beta/alpha (TIM) barrels. For the nine superfolds, the accuracy of predictions of binding site locations is assessed. Implications for protein evolution, and the prediction of protein function either through fold recognition or tertiary structure comparison, are discussed.
本文提出了一种评估叠加蛋白质三维(3D)结构中结合位点相似性的方法,并将其应用于蛋白质数据库中的所有相似结构。对于缺乏显著序列相似性的3D结构之间的相似性,根据蛋白质结构分类(SCOP)数据库,对远缘同源性(古老的共同祖先)和类比(可能收敛于折叠基序)进行了重要区分。超位点被定义为类比蛋白质组(即超折叠)上的结构位置,尽管所考虑的蛋白质几乎没有共同祖先的证据,但这些位置显示出结合底物的统计学显著趋势。我们识别出三个包含超位点的潜在新超折叠:铁氧化还原蛋白样折叠、四螺旋束和双链β螺旋。此外,该方法还量化了同源蛋白质以及先前识别出的超位点(如在β/α(TIM)桶中发现的超位点)内的结合位点相似性。对于九个超折叠,评估了结合位点位置预测的准确性。讨论了对蛋白质进化的影响,以及通过折叠识别或三级结构比较预测蛋白质功能的问题。