Sun Ming-An, Wang Yejun, Zhang Qing, Xia Yiji, Ge Wei, Guo Dianjing
School of Life Sciences and the State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, SAR, China.
Department of Cell Biology and Genetics, School of Basic Medical Sciences, Shenzhen University Health Science Center, Nanhai Ave 3688, Shenzhen, 518060, China.
BMC Genomics. 2017 Apr 4;18(1):279. doi: 10.1186/s12864-017-3668-8.
Disulfide bonds are traditionally considered to play only structural roles. In recent years, increasing evidence suggests that the disulfide proteome is made up of structural disulfides and reversible disulfides. Unlike structural disulfides, reversible disulfides are usually of important functional roles and may serve as redox switches. Interestingly, only specific disulfide bonds are reversible while others are not. However, whether reversible disulfides can be predicted based on structural information remains largely unknown.
In this study, two datasets with both types of disulfides were compiled using independent approaches. By comparison of various features extracted from the local structural signatures, we identified several features that differ significantly between reversible and structural disulfides, including disulfide bond length, along with the number, amino acid composition, secondary structure and physical-chemical properties of surrounding amino acids. A SVM-based classifier was developed for predicting reversible disulfides. RESULTS: By 10-fold cross-validation, the model achieved accuracy of 0.750, sensitivity of 0.352, specificity of 0.953, MCC of 0.405 and AUC of 0.751 using the RevSS_PDB dataset. The robustness was further validated by using RevSS_RedoxDB as independent testing dataset. This model was applied to proteins with known structures in the PDB database. The results show that one third of the predicted reversible disulfide containing proteins are well-known redox enzymes, while the remaining are non-enzyme proteins. Given that reversible disulfides are frequently reported from functionally important non-enzyme proteins such as transcription factors, the predictions may provide valuable candidates of novel reversible disulfides for further experimental investigation.
This study provides the first comparative analysis between the reversible and the structural disulfides. Distinct features remarkably different between these two groups of disulfides were identified, and a SVM-based classifier for predicting reversible disulfides was developed accordingly. A web server named RevssPred can be accessed freely from: http://biocomputer.bio.cuhk.edu.hk/RevssPred .
二硫键传统上被认为仅起结构作用。近年来,越来越多的证据表明二硫键蛋白质组由结构二硫键和可逆二硫键组成。与结构二硫键不同,可逆二硫键通常具有重要的功能作用,并且可能充当氧化还原开关。有趣的是,只有特定的二硫键是可逆的,而其他的则不是。然而,基于结构信息能否预测可逆二硫键在很大程度上仍然未知。
在本研究中,使用独立的方法汇编了包含两种类型二硫键的两个数据集。通过比较从局部结构特征中提取的各种特征,我们确定了可逆二硫键和结构二硫键之间存在显著差异的几个特征,包括二硫键长度以及周围氨基酸的数量、氨基酸组成、二级结构和物理化学性质。开发了一种基于支持向量机的分类器来预测可逆二硫键。
通过10倍交叉验证,使用RevSS_PDB数据集时,该模型的准确率为0.750,灵敏度为0.352,特异性为0.953,马修斯相关系数为0.405,曲线下面积为0.751。通过使用RevSS_RedoxDB作为独立测试数据集进一步验证了其稳健性。该模型应用于PDB数据库中具有已知结构的蛋白质。结果表明,预测的含可逆二硫键的蛋白质中有三分之一是众所周知的氧化还原酶,而其余的是非酶蛋白。鉴于可逆二硫键经常在功能重要的非酶蛋白(如转录因子)中报道,这些预测可能为进一步的实验研究提供新的可逆二硫键的有价值候选物。
本研究首次对可逆二硫键和结构二硫键进行了比较分析。确定了这两组二硫键之间明显不同的特征,并相应地开发了一种基于支持向量机的预测可逆二硫键的分类器。可以从以下网址免费访问名为RevssPred的网络服务器:http://biocomputer.bio.cuhk.edu.hk/RevssPred 。