Fraga Keith J, Joo Hyun, Tsai Jerry
Department of Chemistry, University of the Pacific, Stockton, California, 95211.
Proteins. 2016 Feb;84(2):201-16. doi: 10.1002/prot.24966. Epub 2015 Dec 22.
One difficult aspect of the protein-folding problem is characterizing the nonspecific interactions that define packing in protein tertiary structure. To better understand tertiary structure, this work extends the knob-socket model by classifying the interactions of a single knob residue packed into a set of contiguous sockets, or a pocket made up of 4 or more residues. The knob-socket construct allows for a symbolic two-dimensional mapping of pockets. The two-dimensional mapping of pockets provides a simple method to investigate the variety of pocket shapes to understand the geometry of protein tertiary surfaces. The diversity of pocket geometries can be organized into groups of pockets that share a common core, which suggests that some interactions in pockets are ancillary to packing. Further analysis of pocket geometries displays a preferred configuration that is right-handed in α-helices and left-handed in β-sheets. The amino acid composition of pockets illustrates the importance of nonpolar amino acids in packing as well as position specificity. As expected, all pocket shapes prefer to pack with hydrophobic knobs; however, knobs are not selective for the pockets they pack. Investigating side-chain rotamer preferences for certain pocket shapes uncovers no strong correlations. These findings allow a simple vocabulary based on knobs and sockets to describe protein tertiary packing that supports improved analysis, design, and prediction of protein structure.
蛋白质折叠问题的一个难点在于表征那些决定蛋白质三级结构中堆积的非特异性相互作用。为了更好地理解三级结构,这项工作通过对单个球形残基与一组连续的插孔(即由4个或更多残基组成的口袋)之间的相互作用进行分类,扩展了“球-窝”模型。“球-窝”结构允许对口袋进行符号化的二维映射。口袋的二维映射提供了一种简单的方法来研究口袋形状的多样性,以了解蛋白质三级表面的几何形状。口袋几何形状的多样性可以组织成具有共同核心的口袋组,这表明口袋中的一些相互作用对于堆积是辅助性的。对口袋几何形状的进一步分析显示了一种优选的构型,在α螺旋中是右手的,在β折叠中是左手的。口袋的氨基酸组成说明了非极性氨基酸在堆积中的重要性以及位置特异性。正如预期的那样,所有口袋形状都倾向于与疏水球形残基堆积;然而,球形残基对它们所堆积的口袋没有选择性。研究某些口袋形状的侧链旋转异构体偏好没有发现强相关性。这些发现允许基于“球”和“窝”建立一个简单的词汇表来描述蛋白质三级堆积,这有助于改进对蛋白质结构的分析、设计和预测。