He Fei, Li Jingyi, Wang Rui, Zhao Xiaowei, Han Ye
School of Information Science and Technology, Northeast Normal University, Changchun, 130117, China.
Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, China.
BMC Bioinformatics. 2021 Oct 24;22(1):519. doi: 10.1186/s12859-021-04445-5.
Several computational tools for predicting protein Ubiquitylation and SUMOylation sites have been proposed to study their regulatory roles in gene location, gene expression, and genome replication. However, existing methods generally rely on feature engineering, and ignore the natural similarity between the two types of protein translational modification. This study is the first all-in-one deep network to predict protein Ubiquitylation and SUMOylation sites from protein sequences as well as their crosstalk sites simultaneously. Our deep learning architecture integrates several meta classifiers that apply deep neural networks to protein sequence information and physico-chemical properties, which were trained on multi-label classification mode for simultaneously identifying protein Ubiquitylation and SUMOylation as well as their crosstalk sites.
The promising AUCs of our method on Ubiquitylation, SUMOylation and crosstalk sites achieved 0.838, 0.888, and 0.862 respectively on tenfold cross-validation. The corresponding APs reached 0.683, 0.804 and 0.552, which also validated our effectiveness.
The proposed architecture managed to classify ubiquitylated and SUMOylated lysine residues along with their crosstalk sites, and outperformed other well-known Ubiquitylation and SUMOylation site prediction tools.
已经提出了几种用于预测蛋白质泛素化和类泛素化修饰位点的计算工具,以研究它们在基因定位、基因表达和基因组复制中的调控作用。然而,现有方法通常依赖于特征工程,并且忽略了这两种蛋白质翻译后修饰之间的天然相似性。本研究是首个用于同时从蛋白质序列预测蛋白质泛素化和类泛素化修饰位点及其相互作用位点的一体化深度网络。我们的深度学习架构集成了多个元分类器,这些分类器将深度神经网络应用于蛋白质序列信息和物理化学性质,并在多标签分类模式下进行训练,以同时识别蛋白质泛素化和类泛素化修饰位点及其相互作用位点。
在十折交叉验证中,我们的方法在泛素化、类泛素化修饰位点和相互作用位点上的AUC分别达到了0.838、0.888和0.862。相应的AP分别达到0.683、0.804和0.552,这也验证了我们方法的有效性。
所提出的架构成功地对泛素化和类泛素化修饰的赖氨酸残基及其相互作用位点进行了分类,并且优于其他知名的泛素化和类泛素化修饰位点预测工具。