Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore.
BMC Genomics. 2009 Dec 3;10 Suppl 3(Suppl 3):S6. doi: 10.1186/1471-2164-10-S3-S6.
Caspases belong to a class of cysteine proteases which function as critical effectors in cellular processes such as apoptosis and inflammation by cleaving substrates immediately after unique tetrapeptide sites. With hundreds of reported substrates and many more expected to be discovered, the elucidation of the caspase degradome will be an important milestone in the study of these proteases in human health and disease. Several computational methods for predicting caspase cleavage sites have been developed recently for identifying potential substrates. However, as most of these methods are based primarily on the detection of the tetrapeptide cleavage sites - a factor necessary but not sufficient for predicting in vivo substrate cleavage - prediction outcomes will inevitably include many false positives.
In this paper, we show that structural factors such as the presence of disorder and solvent exposure in the vicinity of the cleavage site are important and can be used to enhance results from cleavage site prediction. We constructed a two-step model incorporating cleavage site prediction and these factors to predict caspase substrates. Sequences are first predicted for cleavage sites using CASVM or GraBCas. Predicted cleavage sites are then scored, ranked and filtered against a cut-off based on their propensities for locating in disordered and solvent exposed regions. Using an independent dataset of caspase substrates, the model was shown to achieve greater positive predictive values compared to CASVM or GraBCas alone, and was able to reduce the false positives pool by up to 13% and 53% respectively while retaining all true positives. We applied our prediction model on the family of receptor tyrosine kinases (RTKs) and highlighted several members as potential caspase targets. The results suggest that RTKs may be generally regulated by caspase cleavage and in some cases, promote the induction of apoptotic cell death - a function distinct from their role as transducers of survival and growth signals.
As a step towards the prediction of in vivo caspase substrates, we have developed an accurate method incorporating cleavage site prediction and structural factors. The multi-factor model augments existing methods and complements experimental efforts to define the caspase degradome on the systems-wide basis.
半胱天冬酶属于半胱氨酸蛋白酶家族,通过在独特的四肽位点后立即切割底物,在细胞凋亡和炎症等细胞过程中充当关键效应因子。已经报道了数百种底物,预计还会发现更多的底物,阐明半胱天冬酶降解组将是研究这些蛋白酶在人类健康和疾病中的一个重要里程碑。最近已经开发了几种用于预测半胱天冬酶切割位点的计算方法,以识别潜在的底物。然而,由于这些方法大多主要基于检测四肽切割位点-预测体内底物切割的必要但不充分因素-预测结果不可避免地会包含许多假阳性。
在本文中,我们表明,结构因素,如切割位点附近的无序和溶剂暴露的存在,是重要的,可以用来增强切割位点预测的结果。我们构建了一个两步模型,将切割位点预测和这些因素结合起来预测半胱天冬酶底物。首先使用 CASVM 或 GraBCas 预测序列的切割位点。然后根据它们位于无序和溶剂暴露区域的倾向,对预测的切割位点进行评分、排序和过滤,以达到截止值。使用独立的半胱天冬酶底物数据集,该模型与 CASVM 或 GraBCas 单独使用相比,实现了更高的阳性预测值,同时能够分别减少高达 13%和 53%的假阳性,并保留所有真阳性。我们将我们的预测模型应用于受体酪氨酸激酶 (RTK) 家族,并强调了几个成员作为潜在的半胱天冬酶靶标。结果表明,RTKs 可能通常受到半胱天冬酶切割的调节,在某些情况下,促进诱导细胞凋亡死亡-这一功能与它们作为生存和生长信号转导的功能不同。
作为预测体内半胱天冬酶底物的一个步骤,我们开发了一种准确的方法,将切割位点预测和结构因素结合起来。多因素模型增强了现有方法,并补充了基于系统范围定义半胱天冬酶降解组的实验工作。