Hartig Gerald R S, Tran Tran T, Smythe Mark L
Institute for Molecular Bioscience, University of Queensland, St. Lucia 4072, Brisbane, Australia.
Protein Sci. 2005 Feb;14(2):474-82. doi: 10.1110/ps.04923305.
The presence and location of intramolecular disulphide bonds are a key determinant of the structure and function of proteins. Intramolecular disulphide bonds in proteins have previously been analyzed under the assumption that there is no clear relationship between disulphide arrangement and disulphide concentration. To investigate this, a set of sequence nonhomologous protein chains containing one or more intramolecular disulphide bonds was extracted from the Protein Data Bank, and the arrangements of the bonds, Protein Data Bank header, and Structural Characterization of Proteins fold were analyzed as a function of intramolecular disulphide bond concentration. Two populations of intramolecular disulphide bond-containing proteins were identified, with a naturally occurring partition at 25 residues per bond. These populations were named intramolecular disulphide bond-rich and -poor. Benefits of partitioning were illustrated by three results: (1) rich chains most frequently contained three disulphides, explaining the plateaux in extant disulphide frequency distributions; (2) a positive relationship between median chain length and the number of disulphides, only seen when the data were partitioned; and (3) the most common bonding pattern for chains with three disulphide bonds was based on the most common for two, only when the data were partitioned. The two populations had different headers, folds, bond arrangements, and chain lengths. Associations between IDSB concentration, IDSB bonding pattern, loop sizes, SCOP fold, and PDB header were also found. From this, we found that intramolecular disulphide bond-rich and -poor proteins follow different bonding rules, and must be considered separately to generate meaningful models of bond formation.
分子内二硫键的存在和位置是蛋白质结构和功能的关键决定因素。此前在分析蛋白质中的分子内二硫键时,假定二硫键排列与二硫键浓度之间没有明确关系。为了对此进行研究,从蛋白质数据库中提取了一组含有一个或多个分子内二硫键的序列非同源蛋白质链,并将这些键的排列、蛋白质数据库头部信息以及蛋白质折叠的结构特征作为分子内二硫键浓度的函数进行分析。鉴定出了两类含有分子内二硫键的蛋白质,每键25个残基处存在自然划分。这些类别被命名为富含分子内二硫键和贫含分子内二硫键的蛋白质。划分的益处通过三个结果得以体现:(1)富含二硫键的链最常含有三个二硫键,这解释了现存二硫键频率分布中的平台现象;(2)只有在对数据进行划分时,才会看到中位链长与二硫键数量之间存在正相关关系;(3)只有在对数据进行划分时,具有三个二硫键的链最常见的键合模式才基于具有两个二硫键的链最常见的模式。这两类蛋白质具有不同的头部信息、折叠方式、键排列和链长。还发现了分子内二硫键浓度、分子内二硫键键合模式、环大小、结构分类数据库折叠以及蛋白质数据库头部信息之间的关联。由此,我们发现富含分子内二硫键和贫含分子内二硫键的蛋白质遵循不同的键合规则,并且必须分别考虑才能生成有意义的键形成模型。