Suppr超能文献

DeepCSO:一种用于预测半胱氨酸S-亚磺酰化位点的深度学习网络方法。

DeepCSO: A Deep-Learning Network Approach to Predicting Cysteine S-Sulphenylation Sites.

作者信息

Lyu Xiaru, Li Shuhao, Jiang Chunyang, He Ningning, Chen Zhen, Zou Yang, Li Lei

机构信息

School of Basic Medicine, Qingdao University, Qingdao, China.

College of Life Sciences, Qingdao University, Qingdao, China.

出版信息

Front Cell Dev Biol. 2020 Dec 1;8:594587. doi: 10.3389/fcell.2020.594587. eCollection 2020.

Abstract

Cysteine S-sulphenylation (CSO), as a novel post-translational modification (PTM), has emerged as a potential mechanism to regulate protein functions and affect signal networks. Because of its functional significance, several prediction approaches have been developed. Nevertheless, they are based on a limited dataset from and there is a lack of prediction tools for the CSO sites of other species. Recently, this modification has been investigated at the proteomics scale for a few species and the number of identified CSO sites has significantly increased. Thus, it is essential to explore the characteristics of this modification across different species and construct prediction models with better performances based on the enlarged dataset. In this study, we constructed several classifiers and found that the long short-term memory model with the word-embedding encoding approach, dubbed LSTM , performs favorably to the traditional machine-learning models and other deep-learning models across different species, in terms of cross-validation and independent test. The area under the receiver operating characteristic (ROC) curve for LSTM ranged from 0.82 to 0.85 for different organisms, which was superior to the reported CSO predictors. Moreover, we developed the general model based on the integrated data from different species and it showed great universality and effectiveness. We provided the on-line prediction service called DeepCSO that included both species-specific and general models, which is accessible through http://www.bioinfogo.org/DeepCSO.

摘要

半胱氨酸S-亚磺酰化(CSO)作为一种新型的翻译后修饰(PTM),已成为调节蛋白质功能和影响信号网络的潜在机制。由于其功能重要性,已经开发了几种预测方法。然而,它们基于有限的数据集,并且缺乏针对其他物种CSO位点的预测工具。最近,已经在蛋白质组学规模上对一些物种的这种修饰进行了研究,并且已鉴定的CSO位点数量显著增加。因此,有必要探索这种修饰在不同物种中的特征,并基于扩大的数据集构建性能更好的预测模型。在本研究中,我们构建了几个分类器,发现采用词嵌入编码方法的长短期记忆模型(称为LSTM )在交叉验证和独立测试方面,在不同物种中比传统机器学习模型和其他深度学习模型表现更优。LSTM 的受试者工作特征(ROC)曲线下面积在不同生物体中为0.82至0.85,优于已报道的CSO预测器。此外,我们基于来自不同物种的整合数据开发了通用模型,它显示出很强的通用性和有效性。我们提供了名为DeepCSO的在线预测服务,其中包括物种特异性模型和通用模型,可通过http://www.bioinfogo.org/DeepCSO访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f6f4/7736615/d427a031a807/fcell-08-594587-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验