Rutkowska Aleksandra, Kolinski Andrzej
Faculty of Chemistry, Warsaw University, Pasteura 1, 02-093 Warsaw, Poland.
Biomacromolecules. 2007 Nov;8(11):3519-24. doi: 10.1021/bm7007718. Epub 2007 Oct 12.
It is known that larger globular proteins are built from domains, relatively independent structural units. A domain size seems to be limited, and a single domain consists of from few tens to a couple of hundred amino acids. Based on Monte Carlo simulations of a reduced protein model restricted to the face centered simple cubic lattice, with a minimal set of short-range and long-range interactions, we have shown that some model sequences upon the folding transition spontaneously divide into separate domains. The observed domain sizes closely correspond to the sizes of real protein domains. Short chains with a proper sequence pattern of the hydrophobic and polar residues undergo a two-state folding transition to the structurally ordered globular state, while similar longer sequences follow a multistate transition. Homopolymeric (uniformly hydrophobic) chains and random heteropolymers undergo a continuous collapse transition into a single globule, and the globular state is much less ordered. Thus, the factors responsible for the multidomain structure of proteins are sufficiently long polypeptide chain and characteristic, protein-like, sequence patterns. These findings provide some hints for the analysis of real sequences aimed at prediction of the domain structure of large proteins.
众所周知,较大的球状蛋白质是由结构域构建而成的,结构域是相对独立的结构单元。一个结构域的大小似乎是有限的,单个结构域由几十到几百个氨基酸组成。基于对限制在面心立方简单晶格上的简化蛋白质模型进行的蒙特卡罗模拟,该模型具有一组最小的短程和长程相互作用,我们已经表明,一些模型序列在折叠转变时会自发地分裂成独立的结构域。观察到的结构域大小与真实蛋白质结构域的大小密切对应。具有适当疏水和极性残基序列模式的短链经历两态折叠转变,形成结构有序的球状状态,而类似的较长序列则遵循多态转变。均聚物(均匀疏水)链和无规杂聚物经历连续的塌缩转变,形成单个小球,并且球状状态的有序程度要低得多。因此,导致蛋白质多结构域结构的因素是足够长的多肽链和特征性的、类似蛋白质的序列模式。这些发现为分析真实序列以预测大蛋白质的结构域结构提供了一些线索。