Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, P. R. China.
Brief Bioinform. 2020 Mar 23;21(2):609-620. doi: 10.1093/bib/bby123.
Post-translational modification (PTM)-based regulation can be mediated not only by the modification of a single residue but also by the interplay of different modifications. Accurate prediction of PTM cross-talk is a highly challenging issue and is in its infant stage. Especially, less attention has been paid to the structural preferences (except intrinsic disorder and spatial proximity) of cross-talk pairs and the characteristics of individual residues involved in cross-talk, which may restrict the improvement of the prediction accuracy. Here we report a structure-based algorithm called PCTpred to improve the PTM cross-talk prediction. The comprehensive residue- and residue pair-based features were designed for paired PTM sites at the sequence and structural levels. Through feature selection, we reserved 23 newly introduced descriptors and 3 traditional descriptors to develop a sequence-based predictor PCTseq and a structure-based predictor PCTstr, both of which were integrated to construct our final prediction model. According to pair- and protein-based evaluations, PCTpred yielded area under the curve values of approximately 0.9 and 0.8, respectively. Even when removing the distance preference of samples or using the input of modeled structures, our prediction performance was maintained or moderately reduced. PCTpred displayed stable and reliable improvements over the state-of-the-art methods based on various evaluations. The source code and data set are freely available at https://github.com/Liulab-HZAU/PCTpred or http://liulab.hzau.edu.cn/PCTpred/.
基于翻译后修饰(PTM)的调节不仅可以由单个残基的修饰介导,还可以由不同修饰的相互作用介导。准确预测 PTM 串扰是一个极具挑战性的问题,目前仍处于起步阶段。特别是,人们对串扰对的结构偏好(除了固有无序和空间接近性)以及涉及串扰的单个残基的特征关注较少,这可能限制了预测准确性的提高。在这里,我们报告了一种称为 PCTpred 的基于结构的算法,以提高 PTM 串扰预测的准确性。该算法在序列和结构水平上针对配对 PTM 位点设计了全面的残基和残基对特征。通过特征选择,我们保留了 23 个新引入的描述符和 3 个传统描述符,用于开发基于序列的预测器 PCTseq 和基于结构的预测器 PCTstr,然后将它们集成到我们的最终预测模型中。根据对配对和蛋白质的评估,PCTpred 的曲线下面积分别约为 0.9 和 0.8。即使去除样本的距离偏好或使用建模结构的输入,我们的预测性能也得以保持或适度降低。基于各种评估,PCTpred 显示出比最新方法更稳定可靠的改进。源代码和数据集可在 https://github.com/Liulab-HZAU/PCTpred 或 http://liulab.hzau.edu.cn/PCTpred/ 上免费获取。