Department of Biology, University of Science, Unjong-District, Pyongyang, DPR Korea.
Proteins. 2010 Mar;78(4):1026-39. doi: 10.1002/prot.22626.
A major challenge of the protein docking problem is to define scoring functions that can distinguish near-native protein complex geometries from a large number of non-native geometries (decoys) generated with noncomplexed protein structures (unbound docking). In this study, we have constructed a neural network that employs the information from atom-pair distance distributions of a large number of decoys to predict protein complex geometries. We found that docking prediction can be significantly improved using two different types of polar hydrogen atoms. To train the neural network, 2000 near-native decoys of even distance distribution were used for each of the 185 considered protein complexes. The neural network normalizes the information from different protein complexes using an additional protein complex identity input neuron for each complex. The parameters of the neural network were determined such that they mimic a scoring funnel in the neighborhood of the native complex structure. The neural network approach avoids the reference state problem, which occurs in deriving knowledge-based energy functions for scoring. We show that a distance-dependent atom pair potential performs much better than a simple atom-pair contact potential. We have compared the performance of our scoring function with other empirical and knowledge-based scoring functions such as ZDOCK 3.0, ZRANK, ITScore-PP, EMPIRE, and RosettaDock. In spite of the simplicity of the method and its functional form, our neural network-based scoring function achieves a reasonable performance in rigid-body unbound docking of proteins. Proteins 2010. (c) 2009 Wiley-Liss, Inc.
蛋白质对接问题的主要挑战之一是定义评分函数,该函数能够区分来自大量非复杂蛋白质结构(无约束对接)生成的非天然蛋白质复合物几何形状的近天然蛋白质复合物几何形状。在这项研究中,我们构建了一个神经网络,该网络使用大量诱饵的原子对距离分布信息来预测蛋白质复合物的几何形状。我们发现,使用两种不同类型的极性氢原子可以显著提高对接预测的效果。为了训练神经网络,对于所考虑的 185 个蛋白质复合物中的每个复合物,使用了 2000 个具有均匀距离分布的近天然诱饵。神经网络使用每个复合物的附加蛋白质复合物身份输入神经元对来自不同蛋白质复合物的信息进行归一化。确定神经网络的参数,使其在天然复合物结构的邻域中模拟评分漏斗。神经网络方法避免了在为评分导出基于知识的能量函数时出现的参考状态问题。我们表明,距离相关的原子对势比简单的原子对接触势表现要好得多。我们比较了我们的评分函数的性能与其他经验和基于知识的评分函数,如 ZDOCK 3.0、ZRANK、ITScore-PP、EMPIRE 和 RosettaDock。尽管该方法简单且功能形式简单,但我们基于神经网络的评分函数在蛋白质的刚性无约束对接中仍能达到合理的性能。蛋白质 2010.(c)2009 Wiley-Liss,Inc.