National Centre for Biomolecular Research, Faculty of Science and CEITEC - Central European Institute of Technology, Masaryk University Brno, Brno-Bohunice, Czech Republic.
J Chem Inf Model. 2011 Aug 22;51(8):1795-806. doi: 10.1021/ci200133w. Epub 2011 Aug 1.
The acid dissociation (ionization) constant pK(a) is one of the fundamental properties of organic molecules. We have evaluated different computational strategies and models to predict the pK(a) values of substituted phenols using partial atomic charges. Partial atomic charges for 124 phenol molecules were calculated using 83 approaches containing seven theory levels (MP2, HF, B3LYP, BLYP, BP86, AM1, and PM3), three basis sets (6-31G*, 6-311G, STO-3G), and five population analyses (MPA, NPA, Hirshfeld, MK, and Löwdin). The correlations between pK(a) and various atomic charge descriptors were examined, and the best descriptors were selected for preparing the quantitative structure-property relationship (QSPR) models. One QSPR model was created for each of the 83 approaches to charge calculation, and then the accuracy of all these models was analyzed and compared. The pK(a)s predicted by most of the models correlate strongly with experimental pK(a) values. For example, more than 25% of the models have correlation coefficients (R²) greater than 0.95 and root-mean-square errors smaller than 0.49. All seven examined theory levels are applicable for pK(a) prediction from charges. The best results were obtained for the MP2 and HF level of theory. The most suitable basis set was found to be 6-31G*. The 6-311G basis set provided slightly weaker correlations, and unexpectedly also, the STO-3G basis set is applicable for the QSPR modeling of pK(a). The Mulliken, natural, and Löwdin population analyses provide accurate models for all tested theory levels and basis sets. The results provided by the Hirshfeld population analysis were also acceptable, but the QSPR models based on MK charges show only weak correlations.
酸离解(电离)常数 pK(a) 是有机分子的基本性质之一。我们评估了不同的计算策略和模型,使用部分原子电荷来预测取代酚的 pK(a) 值。使用 83 种方法计算了 124 个苯酚分子的部分原子电荷,其中包含 7 种理论水平(MP2、HF、B3LYP、BLYP、BP86、AM1 和 PM3)、3 种基组(6-31G*、6-311G、STO-3G)和 5 种电荷分析方法(MPA、NPA、Hirshfeld、MK 和 Löwdin)。研究了 pK(a)与各种原子电荷描述符之间的相关性,并选择了最佳描述符来制备定量构效关系(QSPR)模型。为每个电荷计算方法计算了一个 QSPR 模型,然后分析和比较了所有这些模型的准确性。大多数模型预测的 pK(a)与实验 pK(a)值密切相关。例如,超过 25%的模型的相关系数(R²)大于 0.95,均方根误差小于 0.49。检验的 7 种理论水平都适用于从电荷预测 pK(a)。MP2 和 HF 理论水平的结果最好。最合适的基组被发现是 6-31G*。6-311G 基组提供了稍弱的相关性,出人意料的是,STO-3G 基组也适用于 pK(a)的 QSPR 建模。Mulliken、自然和 Löwdin 电荷分析提供了适用于所有测试理论水平和基组的准确模型。Hirshfeld 电荷分析的结果也可以接受,但基于 MK 电荷的 QSPR 模型仅显示出较弱的相关性。