Benham C J, Jafri M S
Department of Biomathematical Sciences, Mount Sinai School of Medicine, New York, New York 10029.
Protein Sci. 1993 Jan;2(1):41-54. doi: 10.1002/pro.5560020105.
This paper examines the topological properties of protein disulfide bonding patterns. First, a description of these patterns in terms of partially directed graphs is developed. The topologically distinct disulfide bonding patterns available to a polypeptide chain containing n disulfide bonds are enumerated, and their symmetry and reducibility properties are examined. The theoretical probabilities are calculated that a randomly chosen pattern of n bonds will have any combination of symmetry and reducibility properties, given that all patterns have equal probability of being chosen. Next, the National Biomedical Research Foundation protein sequence and Brookhaven National Laboratories protein structure (PDB) databases are examined, and the occurrences of disulfide bonding patterns in them are determined. The frequencies of symmetric and/or reducible patterns are found to exceed theoretical predictions based on equiprobable pattern selection. Kauzmann's model, in which disulfide bonds form during random encounters as the chain assumes random coil conformations, finds that bonds are more likely to form with near neighbor cysteines than with remote cysteines. The observed frequencies of occurrence of disulfide patterns are found here to be virtually uncorrelated with the predictions of this alternative random bonding model. These results strongly suggest that disulfide bond pattern formation is not the result of random factors, but instead is a directed process. Finally, the PDB structure database is examined to determine the extrinsic topologies of polypeptides containing disulfide bonds. A complete survey of all structures in the database found no instances in which two loops formed by disulfide bonds within the same polypeptide chain are topologically linked. Similarly, no instances are found in which two loops present on different polypeptide chains in a structure are catenated. Further, no examples of topologically knotted loops occur. In contrast, pseudolinking has been found to be a relatively frequent event. These results show a complete avoidance of nontrivial topological entanglements that is unlikely to be the result of chance events. A hypothesis is presented to account for some of these observations.
本文研究了蛋白质二硫键连接模式的拓扑性质。首先,建立了基于部分有向图的这些模式的描述。列举了含有n个二硫键的多肽链可形成的拓扑不同的二硫键连接模式,并研究了它们的对称性和可约性。计算了随机选择的n个键的模式具有任何对称性和可约性组合的理论概率,前提是所有模式被选择的概率相等。接下来,研究了美国国家生物医学研究基金会蛋白质序列数据库和布鲁克海文国家实验室蛋白质结构(PDB)数据库,并确定了其中二硫键连接模式的出现情况。发现对称和/或可约模式的频率超过了基于等概率模式选择的理论预测。考兹曼模型认为,当链呈现随机卷曲构象时,二硫键在随机碰撞过程中形成,该模型发现二硫键与近邻半胱氨酸形成的可能性比与远程半胱氨酸形成的可能性更大。本文发现观察到的二硫键模式出现频率与这种替代随机结合模型的预测几乎不相关。这些结果强烈表明,二硫键模式的形成不是随机因素的结果,而是一个有导向的过程。最后,研究PDB结构数据库以确定含有二硫键的多肽的外在拓扑结构。对数据库中所有结构的全面调查发现,没有实例表明同一多肽链内由二硫键形成的两个环在拓扑上是相连的。同样,在一个结构中不同多肽链上的两个环也没有出现连环的情况。此外,也没有拓扑打结环的例子。相比之下,假连接已被发现是一个相对频繁的事件。这些结果表明完全避免了非平凡的拓扑缠结,这不太可能是偶然事件的结果。本文提出了一个假设来解释其中的一些观察结果。