Huggins David J
Theory of Condensed Matter Group, Cavendish Laboratory, University of Cambridge , 19 J J Thomson Avenue, Cambridge CB3 0HE, United Kingdom.
J Chem Theory Comput. 2014 Sep 9;10(9):3617-25. doi: 10.1021/ct500415g.
Inhomogeneous fluid solvation theory (IFST) and free energy perturbation (FEP) calculations were performed for a set of 20 solutes to compute the hydration free energies. We identify the weakness of histogram methods in computing the IFST hydration entropy by showing that previously employed histogram methods overestimate the translational and orientational entropies and thus underestimate their contribution to the free energy by a significant amount. Conversely, we demonstrate the accuracy of the k-nearest neighbors (KNN) algorithm in computing these translational and orientational entropies. Implementing the KNN algorithm within the IFST framework produces a powerful method that can be used to calculate free-energy changes for large perturbations. We introduce a new KNN approach to compute the total solute-water entropy with six degrees of freedom, as well as the translational and orientational contributions. However, results suggest that both the solute-water and water-water entropy terms are significant and must be included. When they are combined, the IFST and FEP hydration free energies are highly correlated, with an R(2) of 0.999 and a mean unsigned difference of 0.9 kcal/mol. IFST predictions are also highly correlated with experimental hydration free energies, with an R(2) of 0.997 and a mean unsigned error of 1.2 kcal/mol. In summary, the KNN algorithm is shown to yield accurate estimates of the combined translational-orientational entropy and the novel approach of combining distance metrics that is developed here could be extended to provide a powerful method for entropy estimation in numerous contexts.
针对一组20种溶质进行了非均匀流体溶剂化理论(IFST)和自由能微扰(FEP)计算,以计算水合自由能。我们通过表明先前采用的直方图方法高估了平动熵和取向熵,从而显著低估了它们对自由能的贡献,确定了直方图方法在计算IFST水合熵方面的弱点。相反,我们证明了k近邻(KNN)算法在计算这些平动熵和取向熵方面的准确性。在IFST框架内实现KNN算法产生了一种强大的方法,可用于计算大扰动下的自由能变化。我们引入了一种新的KNN方法来计算具有六个自由度的总溶质 - 水熵以及平动和取向贡献。然而,结果表明溶质 - 水和水 - 水熵项都很重要,必须包括在内。当它们结合时,IFST和FEP水合自由能高度相关,R(2)为0.999,平均无符号差异为0.9 kcal/mol。IFST预测也与实验水合自由能高度相关,R(2)为0.997,平均无符号误差为1.2 kcal/mol。总之,KNN算法被证明能够准确估计组合的平动 - 取向熵,并且这里开发的结合距离度量的新方法可以扩展,以提供一种在众多情况下进行熵估计的强大方法。