Bao Wenzheng, You Zhu-Hong, Huang De-Shuang
Institute of Machine Learning and Systems Biology, School of Electronics and Information Engineering, Tongji University, Shanghai, China.
Xinjiang Technical Institutes of Physics and Chemistry, Chinese Academy of Science, Urumqi 830011, China.
Oncotarget. 2017 Nov 6;8(65):108867-108879. doi: 10.18632/oncotarget.22335. eCollection 2017 Dec 12.
Recently, experiments revealed the pupylation to be a signal for the selective regulation of proteins in several serious human diseases. As one of the most significant post translational modification in the field of biology and disease, pupylation has the ability to playing the key role in the regulation various diseases' biological processes. Meanwhile, effectively identification such type modification will be helpful for proteins to perform their biological functions and contribute to understanding the molecular mechanism, which is the foundation of drug design. The existing algorithms of identification such types of modified sites often have some defects, such as low accuracy and time-consuming. In this research, the pupylation sites' identification model, CIPPN, demonstrates better performance than other existing approaches in this field. The proposed predictor achieves value of 89.12 and value of 0.7949 in 10-fold cross-validation tests in the Pupdb Database (http://cwtung.kmu.edu.tw/pupdb). Significantly, such algorithm not only investigates the sequential, structural and evolutionary hallmarks around pupylation sites but also compares the differences of pupylation from the environmental, conservative and functional characterization of substrates. Therefore, the proposed feature description approach and algorithm results prove to be useful for further experimental investigation of such modification's identification.
最近,实验表明,在几种严重的人类疾病中,泛素样修饰是蛋白质选择性调控的信号。作为生物学和疾病领域最重要的翻译后修饰之一,泛素样修饰能够在各种疾病的生物学过程调控中发挥关键作用。同时,有效识别这种类型的修饰将有助于蛋白质发挥其生物学功能,并有助于理解分子机制,而分子机制是药物设计的基础。现有的识别此类修饰位点的算法往往存在一些缺陷,如准确性低和耗时。在本研究中,泛素样修饰位点识别模型CIPPN在该领域的表现优于其他现有方法。在Pupdb数据库(http://cwtung.kmu.edu.tw/pupdb)的10折交叉验证测试中,所提出的预测器的马修斯相关系数值为89.12,F1值为0.7949。值得注意的是,该算法不仅研究了泛素样修饰位点周围的序列、结构和进化特征,还从底物的环境、保守和功能特征方面比较了泛素样修饰的差异。因此,所提出的特征描述方法和算法结果被证明对这种修饰识别的进一步实验研究是有用的。