Li Shihua, Yu Kai, Wu Guandi, Zhang Qingfeng, Wang Panqin, Zheng Jian, Liu Ze-Xian, Wang Jichao, Gao Xinjiao, Cheng Han
State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, China.
School of Life Sciences, Zhengzhou University, Zhengzhou, China.
Front Cell Dev Biol. 2021 Feb 23;9:617366. doi: 10.3389/fcell.2021.617366. eCollection 2021.
Thiol groups on cysteines can undergo multiple post-translational modifications (PTMs), acting as a molecular switch to maintain redox homeostasis and regulating a series of cell signaling transductions. Identification of sophistical protein cysteine modifications is crucial for dissecting its underlying regulatory mechanism. Instead of a time-consuming and labor-intensive experimental method, various computational methods have attracted intense research interest due to their convenience and low cost. Here, we developed the first comprehensive deep learning based tool pCysMod for multiple protein cysteine modification prediction, including -nitrosylation, -palmitoylation, -sulfenylation, -sulfhydration, and -sulfinylation. Experimentally verified cysteine sites curated from literature and sites collected by other databases and predicting tools were integrated as benchmark dataset. Several protein sequence features were extracted and united into a deep learning model, and the hyperparameters were optimized by particle swarm optimization algorithms. Cross-validations indicated our model showed excellent robustness and outperformed existing tools, which was able to achieve an average AUC of 0.793, 0.807, 0.796, 0.793, and 0.876 for -nitrosylation, -palmitoylation, -sulfenylation, -sulfhydration, and -sulfinylation, demonstrating pCysMod was stable and suitable for protein cysteine modification prediction. Besides, we constructed a comprehensive protein cysteine modification prediction web server based on this model to benefit the researches finding the potential modification sites of their interested proteins, which could be accessed at http://pcysmod.omicsbio.info. This work will undoubtedly greatly promote the study of protein cysteine modification and contribute to clarifying the biological regulation mechanisms of cysteine modification within and among the cells.
半胱氨酸上的巯基可经历多种翻译后修饰(PTM),充当维持氧化还原稳态的分子开关并调节一系列细胞信号转导。识别复杂的蛋白质半胱氨酸修饰对于剖析其潜在的调控机制至关重要。由于其便利性和低成本,各种计算方法已吸引了广泛的研究兴趣,而不是采用耗时且费力的实验方法。在此,我们开发了首个基于深度学习的综合性工具pCysMod,用于多种蛋白质半胱氨酸修饰预测,包括亚硝基化、棕榈酰化、亚磺化、巯基化和亚砜化。从文献中整理的以及其他数据库和预测工具收集的经过实验验证的半胱氨酸位点被整合为基准数据集。提取了几种蛋白质序列特征并将其整合到一个深度学习模型中,通过粒子群优化算法对超参数进行了优化。交叉验证表明,我们的模型具有出色的稳健性,优于现有工具,对于亚硝基化、棕榈酰化、亚磺化、巯基化和亚砜化,其平均AUC分别能够达到0.793、0.807、0.796、0.793和0.876,表明pCysMod稳定且适用于蛋白质半胱氨酸修饰预测。此外,我们基于此模型构建了一个综合性蛋白质半胱氨酸修饰预测网络服务器,以方便研究人员找到他们感兴趣蛋白质的潜在修饰位点,该服务器可通过http://pcysmod.omicsbio.info访问。这项工作无疑将极大地促进蛋白质半胱氨酸修饰的研究,并有助于阐明细胞内和细胞间半胱氨酸修饰的生物学调控机制。