Luo Yin, Jiang Jiulei, Zhu Jiajie, Huang Qiyi, Li Weimin, Wang Ying, Gao Yamin
School of Life Sciences, East China Normal University, Shanghai, China.
School of Computer Science and Engineering, Changshu Institute of Technology, Suzhou, China.
Front Plant Sci. 2022 May 25;13:884903. doi: 10.3389/fpls.2022.884903. eCollection 2022.
Ubiquitination, a widespread mechanism of regulating cellular responses in plants, is one of the most important post-translational modifications of proteins in many biological processes and is involved in the regulation of plant disease resistance responses. Predicting ubiquitination is an important technical method for plant protection. Traditional ubiquitination site determination methods are costly and time-consuming, while computational-based prediction methods can accurately and efficiently predict ubiquitination sites. At present, capsule networks and deep learning are used alone for prediction, and the effect is not obvious. The capsule network reflects the spatial position relationship of the internal features of the neural network, but it cannot identify long-distance dependencies or focus on amino acids in protein sequences or their degree of importance. In this study, we investigated the use of convolutional neural networks and capsule networks in deep learning to design a novel model "Caps-Ubi," first using the one-hot and amino acid continuous type hybrid encoding method to characterize ubiquitination sites. The sequence patterns, the dependencies between the encoded protein sequences and the important amino acids in the captured sequences, were then focused on the importance of amino acids in the sequences through the proposed Caps-Ubi model and used for multispecies ubiquitination site prediction. Through relevant experiments, the proposed Caps-Ubi method is superior to other similar methods in predicting ubiquitination sites.
泛素化是植物中调节细胞反应的一种广泛机制,是许多生物过程中蛋白质最重要的翻译后修饰之一,参与植物抗病反应的调控。预测泛素化是植物保护的一种重要技术方法。传统的泛素化位点测定方法成本高且耗时,而基于计算的预测方法可以准确、高效地预测泛素化位点。目前,胶囊网络和深度学习单独用于预测,效果并不明显。胶囊网络反映了神经网络内部特征的空间位置关系,但它无法识别长距离依赖性,也无法关注蛋白质序列中的氨基酸或其重要程度。在本研究中,我们研究了在深度学习中使用卷积神经网络和胶囊网络来设计一种新型模型“Caps-Ubi”,首先使用独热编码和氨基酸连续类型混合编码方法来表征泛素化位点。然后通过提出的Caps-Ubi模型关注序列中氨基酸的重要性,聚焦编码蛋白质序列之间的序列模式、依赖性以及捕获序列中的重要氨基酸,并将其用于多物种泛素化位点预测。通过相关实验,所提出的Caps-Ubi方法在预测泛素化位点方面优于其他类似方法。