Biocomputing Group, University of Bologna, CIRI-Life Science and Health Technologies and Department of Biology, Via San Giacomo 9/2, Bologna, Italy.
Bioinformatics. 2011 Aug 15;27(16):2224-30. doi: 10.1093/bioinformatics/btr387. Epub 2011 Jun 29.
Disulfide bonds stabilize protein structures and play relevant roles in their functions. Their formation requires an oxidizing environment and their stability is consequently depending on the redox ambient potential, which may differ according to the subcellular compartment. Several methods are available to predict cysteine-bonding state and connectivity patterns. However, none of them takes into consideration the relevance of protein subcellular localization.
Here we develop DISLOCATE, a two-step method based on machine learning models for predicting both the bonding state and the connectivity patterns of cysteine residues in a protein chain. We find that the inclusion of protein subcellular localization improves the performance of these predictive steps by 3 and 2 percentage points, respectively. When compared with previously developed methods for predicting disulfide bonds from sequence, DISLOCATE improves the overall performance by more than 10 percentage points.
The method and the dataset are available at the Web page http://www.biocomp.unibo.it/savojard/Dislocate.html. GRHCRF code is available at http://www.biocomp.unibo.it/savojard/biocrf.html.
二硫键稳定蛋白质结构,并在其功能中发挥相关作用。它们的形成需要氧化环境,因此它们的稳定性取决于氧化还原环境电势,这可能因亚细胞区室而异。有几种方法可用于预测半胱氨酸键合状态和连接模式。但是,它们都没有考虑到蛋白质亚细胞定位的相关性。
在这里,我们开发了 DISLOCATE,这是一种基于机器学习模型的两步法,用于预测蛋白质链中半胱氨酸残基的键合状态和连接模式。我们发现,将蛋白质亚细胞定位纳入其中可以分别将这些预测步骤的性能提高 3 和 2 个百分点。与以前从序列预测二硫键的方法相比,Dislocate 的整体性能提高了 10 多个百分点。
该方法和数据集可在网页 http://www.biocomp.unibo.it/savojard/Dislocate.html 上获得。GRHCRF 代码可在 http://www.biocomp.unibo.it/savojard/biocrf.html 上获得。