Haspel Nurit, Tsai Chung-Jung, Wolfson Haim, Nussinov Ruth
Sackler Institute of Molecular Medicine, Department of Human Genetics and Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel.
Protein Sci. 2003 Jun;12(6):1177-87. doi: 10.1110/ps.0232903.
Understanding, and ultimately predicting, how a 1-D protein chain reaches its native 3-D fold has been one of the most challenging problems during the last few decades. Data increasingly indicate that protein folding is a hierarchical process. Hence, the question arises as to whether we can use the hierarchical concept to reduce the practically intractable computational times. For such a scheme to work, the first step is to cut the protein sequence into fragments that form local minima on the polypeptide chain. The conformations of such fragments in solution are likely to be similar to those when the fragments are embedded in the native fold, although alternate conformations may be favored during the mutual stabilization in the combinatorial assembly process. Two elements are needed for such cutting: (1) a library of (clustered) fragments derived from known protein structures and (2) an assignment algorithm that selects optimal combinations to "cover" the protein sequence. The next two steps in hierarchical folding schemes, not addressed here, are the combinatorial assembly of the fragments and finally, optimization of the obtained conformations. Here, we address the first step in a hierarchical protein-folding scheme. The input is a target protein sequence and a library of fragments created by clustering building blocks that were generated by cutting all protein structures. The output is a set of cutout fragments. We briefly outline a graph theoretic algorithm that automatically assigns building blocks to the target sequence, and we describe a sample of the results we have obtained.
在过去几十年中,理解并最终预测一维蛋白质链如何形成其天然三维折叠结构一直是最具挑战性的问题之一。越来越多的数据表明,蛋白质折叠是一个分层过程。因此,问题就来了:我们是否可以利用分层概念来减少实际中难以处理的计算时间。要使这样的方案可行,第一步是将蛋白质序列切割成在多肽链上形成局部最小值的片段。这些片段在溶液中的构象可能与它们嵌入天然折叠结构时的构象相似,尽管在组合组装过程中的相互稳定过程中可能更倾向于交替构象。进行这种切割需要两个要素:(1)一个源自已知蛋白质结构的(聚类)片段库,以及(2)一种分配算法,该算法选择最优组合以“覆盖”蛋白质序列。分层折叠方案中的接下来两个步骤(此处未涉及)是片段的组合组装,最后是对所得构象的优化。在这里,我们讨论分层蛋白质折叠方案的第一步。输入是一个目标蛋白质序列和一个通过对所有蛋白质结构进行切割生成的构建块聚类创建的片段库。输出是一组裁剪片段。我们简要概述一种自动将构建块分配给目标序列的图论算法,并描述我们获得的一些结果示例。