Xue Bin, Faraggi Eshel, Zhou Yaoqi
Indiana University School of Informatics, Indiana University-Purdue University, Indianapolis, Indiana 46202, USA.
Proteins. 2009 Jul;76(1):176-83. doi: 10.1002/prot.22329.
A neural network method (SPINE-2D) is introduced to provide a sequence-based prediction of residue-residue contact maps. This method is built on the success of SPINE in predicting secondary structure, residue solvent accessibility, and backbone torsion angles via large-scale training with overfit protection and a two-layer neural network. SPINE-2D achieved a 10-fold cross-validated accuracy of 47% (+/-2%) for top L/5 predicted contacts between two residues with sequence separation of six or more and an accuracy of 24 +/- 1% for nonlocal contacts with sequence separation of 24 residues or more. The accuracies of 23% and 26% for nonlocal contact predictions are achieved for two independent datasets of 500 proteins and 82 CASP 7 targets, respectively. A comparison with other methods indicates that SPINE-2D is among the most accurate methods for contact-map prediction. SPINE-2D is available as a webserver at http://sparks.informatics.iupui.edu.
介绍了一种神经网络方法(SPINE-2D),用于基于序列预测残基-残基接触图。该方法基于SPINE通过大规模训练并采用过拟合保护和两层神经网络成功预测二级结构、残基溶剂可及性和主链扭转角。对于序列间隔为六个或更多的两个残基之间预测的前L/5个接触,SPINE-2D在10倍交叉验证中的准确率为47%(±2%);对于序列间隔为24个残基或更多的非局部接触,准确率为24±1%。对于分别包含500个蛋白质和82个CASP 7目标的两个独立数据集,非局部接触预测的准确率分别为23%和26%。与其他方法的比较表明,SPINE-2D是接触图预测中最准确的方法之一。可通过网络服务器http://sparks.informatics.iupui.edu使用SPINE-2D。