Fokas A S, Papatheodorou T S, Kister A E, Gelfand I M
Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge CB3 0WA, United Kingdom.
Proc Natl Acad Sci U S A. 2005 Nov 1;102(44):15851-3. doi: 10.1073/pnas.0507335102. Epub 2005 Oct 25.
For a large class of proteins called sandwich-like proteins (SPs), the secondary structures consist of two beta-sheets packed face-to-face, with each beta-sheet consisting typically of three to five beta-strands. An important step in the prediction of the three-dimensional structure of a SP is the prediction of its supersecondary structure, namely the prediction of the arrangement of the beta-strands in the two beta-sheets. Recently, significant progress in this direction was made, where it was shown that 91% of observed SPs form what we here call "canonical motifs." Here, we show that all canonical motifs can be constructed in a simple manner that is based on thermodynamic considerations and uses certain geometric structures. The number of these structures is much smaller than the number of possible strand arrangements. For instance, whereas for SPs consisting of six strands there exist a priori 900 possible strand arrangements, there exist only five geometric structures. Furthermore, the few motifs that are noncanonial can be constructed from canonical motifs by a simple procedure.
对于一大类被称为类三明治蛋白(SPs)的蛋白质,其二级结构由两个面对面堆积的β折叠组成,每个β折叠通常由三到五条β链组成。预测SP三维结构的一个重要步骤是预测其超二级结构,即预测两个β折叠中β链的排列方式。最近,在这个方向上取得了重大进展,结果表明91%的观察到的SPs形成了我们这里所称的“典型基序”。在这里,我们表明所有典型基序都可以通过一种基于热力学考虑并使用某些几何结构的简单方式构建。这些结构的数量远少于可能的链排列数量。例如,对于由六条链组成的SPs,先验地存在900种可能的链排列方式,而几何结构只有五种。此外,少数非典型基序可以通过一个简单的过程从典型基序构建出来。