Faraggi Eshel, Xue Bin, Zhou Yaoqi
Indiana University School of Informatics, Indiana University-Purdue University, Indianapolis, IN 46202, USA.
Proteins. 2009 Mar;74(4):847-56. doi: 10.1002/prot.22193.
This article attempts to increase the prediction accuracy of residue solvent accessibility and real-value backbone torsion angles of proteins through improved learning. Most methods developed for improving the backpropagation algorithm of artificial neural networks are limited to small neural networks. Here, we introduce a guided-learning method suitable for networks of any size. The method employs a part of the weights for guiding and the other part for training and optimization. We demonstrate this technique by predicting residue solvent accessibility and real-value backbone torsion angles of proteins. In this application, the guiding factor is designed to satisfy the intuitive condition that for most residues, the contribution of a residue to the structural properties of another residue is smaller for greater separation in the protein-sequence distance between the two residues. We show that the guided-learning method makes a 2-4% reduction in 10-fold cross-validated mean absolute errors (MAE) for predicting residue solvent accessibility and backbone torsion angles, regardless of the size of database, the number of hidden layers and the size of input windows. This together with introduction of two-layer neural network with a bipolar activation function leads to a new method that has a MAE of 0.11 for residue solvent accessibility, 36 degrees for psi, and 22 degrees for phi. The method is available as a Real-SPINE 3.0 server in http://sparks.informatics.iupui.edu.
本文试图通过改进学习来提高蛋白质残基溶剂可及性和真实值主链扭转角的预测准确性。大多数为改进人工神经网络的反向传播算法而开发的方法仅限于小型神经网络。在此,我们引入一种适用于任何规模网络的引导学习方法。该方法使用一部分权重进行引导,另一部分用于训练和优化。我们通过预测蛋白质的残基溶剂可及性和真实值主链扭转角来演示此技术。在这个应用中,引导因子的设计满足直观条件:对于大多数残基,两个残基在蛋白质序列距离上的分离越大,一个残基对另一个残基结构性质的贡献就越小。我们表明,无论数据库大小、隐藏层数和输入窗口大小如何,引导学习方法在预测残基溶剂可及性和主链扭转角的10折交叉验证平均绝对误差(MAE)方面降低了2 - 4%。这与引入具有双极激活函数的两层神经网络一起,产生了一种新方法,该方法对于残基溶剂可及性的MAE为0.11,对于ψ角为36度,对于φ角为22度。该方法可在http://sparks.informatics.iupui.edu上作为Real - SPINE 3.0服务器使用。