Jiang-Ning Song, Wei-Jiang Li, Wen-Bo Xu
The Key Laboratory of Industrial Biotechnology, Ministry of Education, Southern Yangtze University, 170 Huihe Road, Wuxi 214036, China.
J Theor Biol. 2004 Nov 7;231(1):85-95. doi: 10.1016/j.jtbi.2004.06.002.
Based on the 639 non-homologous proteins with 2910 cysteine-containing segments of well-resolved three-dimensional structures, a novel approach has been proposed to predict the disulfide-bonding state of cysteines in proteins by constructing a two-stage classifier combining a first global linear discriminator based on their amino acid composition and a second local support vector machine classifier. The overall prediction accuracy of this hybrid classifier for the disulfide-bonding state of cysteines in proteins has scored 84.1% and 80.1%, when measured on cysteine and protein basis using the rigorous jack-knife procedure, respectively. It shows that whether cysteines should form disulfide bonds depends not only on the global structural features of proteins but also on the local sequence environment of proteins. The result demonstrates the applicability of this novel method and provides comparable prediction performance compared with existing methods for the prediction of the oxidation states of cysteines in proteins.
基于639个具有2910个含半胱氨酸片段的非同源蛋白质的高分辨率三维结构,提出了一种新方法,通过构建一个两阶段分类器来预测蛋白质中半胱氨酸的二硫键结合状态,该分类器结合了基于氨基酸组成的第一全局线性判别器和第二局部支持向量机分类器。当使用严格的留一法在半胱氨酸和蛋白质基础上进行测量时,这种混合分类器对蛋白质中半胱氨酸二硫键结合状态的总体预测准确率分别达到了84.1%和80.1%。这表明半胱氨酸是否应形成二硫键不仅取决于蛋白质的全局结构特征,还取决于蛋白质的局部序列环境。结果证明了这种新方法的适用性,并且与现有蛋白质中半胱氨酸氧化态预测方法相比,具有相当的预测性能。