Szilágyi A, Györffy D, Závodszky P
Institute of Enzymology, BRC, Hungarian Academy of Sciences Karolina út 29, H-1113 Budapest, Hungary.
Biophys J. 2008 Aug;95(4):1612-26. doi: 10.1529/biophysj.108.131151. Epub 2008 Apr 25.
The amino acid composition of intrinsically disordered proteins and protein segments characteristically differs from that of ordered proteins. This observation forms the basis of several disorder prediction methods. These, however, usually perform worse for smaller proteins (or segments) than for larger ones. We show that the regions of amino acid composition space corresponding to ordered and disordered proteins overlap with each other, and the extent of the overlap (the "twilight zone") is larger for short than for long chains. To explain this finding, we used two-dimensional lattice model proteins containing hydrophobic, polar, and charged monomers and revealed the relation among chain length, amino acid composition, and disorder. Because the number of chain configurations exponentially grows with chain length, a larger fraction of longer chains can reach a low-energy, ordered state than do shorter chains. The amount of information carried by the amino acid composition about whether a protein or segment is (dis)ordered grows with increasing chain length. Smaller proteins rely more on specific interactions for stability, which limits the possible accuracy of disorder prediction methods. For proteins in the "twilight zone", size can determine order, as illustrated by the example of two-state homodimers.
内在无序蛋白质和蛋白质片段的氨基酸组成在特征上不同于有序蛋白质。这一观察结果构成了几种无序预测方法的基础。然而,这些方法通常对较小的蛋白质(或片段)的预测效果比对较大的蛋白质更差。我们表明,与有序和无序蛋白质相对应的氨基酸组成空间区域相互重叠,并且短链的重叠程度(“模糊区域”)比长链更大。为了解释这一发现,我们使用了包含疏水、极性和带电单体的二维晶格模型蛋白质,并揭示了链长、氨基酸组成和无序之间的关系。由于链构象的数量随链长呈指数增长,与较短链相比,较长链中更大比例的链能够达到低能量的有序状态。氨基酸组成所携带的关于蛋白质或片段是否无序的信息量随着链长的增加而增加。较小的蛋白质更多地依赖特定相互作用来维持稳定性,这限制了无序预测方法的可能准确性。对于处于“模糊区域”的蛋白质,大小可以决定有序性,这在两态同型二聚体的例子中得到了说明。