Sanford-Burnham Medical Research Institute, 10901 North Torrey Pines Road, La Jolla, California 92037, USA.
J Proteome Res. 2011 Aug 5;10(8):3642-51. doi: 10.1021/pr200271w. Epub 2011 Jul 8.
Limited or regulatory proteolysis plays a critical role in many important biological pathways like blood coagulation, cell proliferation, and apoptosis. A better understanding of mechanisms that control this process is required for discovering new proteolytic events and for developing inhibitors with potential therapeutic value. Two features that determine the susceptibility of peptide bonds to proteolysis are the sequence in the vicinity of the scissile bond and the structural context in which the bond is displayed. In this study, we assessed statistical significance and predictive power of individual structural descriptors and combination thereof for the identification of cleavage sites. The analysis was performed on a data set of >200 proteolytic events documented in CutDB for a variety of mammalian regulatory proteases and their physiological substrates with known 3D structures. The results confirmed the significance and provided a ranking within three main categories of structural features: exposure > flexibility > local interactions. Among secondary structure elements, the largest frequency of proteolytic cleavage was confirmed for loops and lower but significant frequency for helices. Limited proteolysis has lower albeit appreciable frequency of occurrence in certain types of β-strands, which is in contrast with some previous reports. Descriptors deduced directly from the amino acid sequence displayed only marginal predictive capabilities. Homology-based structural models showed a predictive performance comparable to protein substrates with experimentally established structures. Overall, this study provided a foundation for accurate automated prediction of segments of protein structure susceptible to proteolytic processing and, potentially, other post-translational modifications.
有限的或调节性蛋白水解在许多重要的生物学途径中起着关键作用,如血液凝固、细胞增殖和细胞凋亡。为了发现新的蛋白水解事件,并开发具有潜在治疗价值的抑制剂,需要更好地了解控制这一过程的机制。决定肽键对蛋白水解敏感性的两个特征是靠近裂解键的序列和键显示的结构上下文。在这项研究中,我们评估了单个结构描述符及其组合识别切割位点的统计显著性和预测能力。该分析是在 CutDB 中为各种哺乳动物调节蛋白酶及其具有已知 3D 结构的生理底物记录的>200 个蛋白水解事件的数据集中进行的。结果证实了这些特征的重要性,并在三个主要结构特征类别内进行了排名:暴露>灵活性>局部相互作用。在二级结构元件中,循环的蛋白水解切割频率最大,但螺旋的频率较低,但也很显著。尽管发生频率较低,但有限的蛋白水解在某些类型的β-折叠中也有发生,这与之前的一些报道相反。直接从氨基酸序列推断出的描述符仅显示出微不足道的预测能力。基于同源性的结构模型显示出与具有实验确定结构的蛋白质底物相当的预测性能。总的来说,这项研究为准确自动预测蛋白质结构中易受蛋白水解加工和潜在其他翻译后修饰的片段提供了基础。