Sankar Kannan, Jia Kejue, Jernigan Robert L
Bioinformatics and Computational Biology Interdepartmental Program, Iowa State University, Ames, IA 50011.
Roy J. Carver Department of Biochemistry, Biophysics, and Molecular Biology, Iowa State University, Ames, IA 50011.
Proc Natl Acad Sci U S A. 2017 Mar 14;114(11):2928-2933. doi: 10.1073/pnas.1613331114. Epub 2017 Mar 6.
Evaluating protein structures requires reliable free energies with good estimates of both potential energies and entropies. Although there are many demonstrated successes from using knowledge-based potential energies, computing entropies of proteins has lagged far behind. Here we take an entirely different approach and evaluate knowledge-based conformational entropies of proteins based on the observed frequencies of contact changes between amino acids in a set of 167 diverse proteins, each of which has two alternative structures. The results show that charged and polar interactions break more often than hydrophobic pairs. This pattern correlates strongly with the average solvent exposure of amino acids in globular proteins, as well as with polarity indices and the sizes of the amino acids. Knowledge-based entropies are derived by using the inverse Boltzmann relationship, in a manner analogous to the way that knowledge-based potentials have been extracted. Including these new knowledge-based entropies almost doubles the performance of knowledge-based potentials in selecting the native protein structures from decoy sets. Beyond the overall energy-entropy compensation, a similar compensation is seen for individual pairs of interacting amino acids. The entropies in this report have immediate applications for 3D structure prediction, protein model assessment, and protein engineering and design.
评估蛋白质结构需要可靠的自由能,同时对势能和熵都要有良好的估计。尽管基于知识的势能有许多成功的应用案例,但蛋白质熵的计算却远远落后。在此,我们采用了一种截然不同的方法,基于167种不同蛋白质中氨基酸之间接触变化的观测频率,评估基于知识的蛋白质构象熵,其中每种蛋白质都有两种不同的结构。结果表明,带电和极性相互作用比疏水对更容易断裂。这种模式与球状蛋白质中氨基酸的平均溶剂暴露程度、极性指数以及氨基酸大小密切相关。基于知识的熵是通过使用玻尔兹曼逆关系推导出来的,其方式类似于提取基于知识的势能。在从诱饵集中选择天然蛋白质结构时,纳入这些新的基于知识的熵几乎使基于知识的势能的性能提高了一倍。除了整体的能量 - 熵补偿外,对于相互作用的氨基酸对也观察到了类似的补偿。本报告中的熵在三维结构预测、蛋白质模型评估以及蛋白质工程与设计方面有直接应用。