Ma Weiping, Zhang Xiaoyun, Luan Feng, Zhang Haixia, Zhang Ruisheng, Liu Mancang, Hu Zhide, Fan B T
Department of Chemistry, Lanzhou University, Lanzhou 730000, China.
J Phys Chem A. 2005 Apr 21;109(15):3485-92. doi: 10.1021/jp0501446.
A new method support vector machine (SVM) and the heuristic method (HM) were used to develop nonlinear and linear models between the solubility in electrolyte containing sodium chloride and three molecular descriptors of 217 nonelectrolytes. The molecular descriptors representing the structural features of the compounds include two topological and one electrostatic descriptor. The three molecular descriptors selected by HM in CODESSA were used as inputs for SVM. The results obtained by HM and SVM both were satisfactory. The model of HM leads to a correlation coefficient (R) of 0.980 and root-mean-square error (RMS) of 0.219 for the test set. The same descriptors were also employed to build the model in pure water, and the prediction results were consistent with the experimental solubilities. Furthermore, a predictive correlation coefficient R = 0.988 and RMS error of 0.170 for the test set were obtained by SVM. The prediction results are in very good agreement with the experimental values. This paper provides a new and effective method for predicting the solubility in electrolyte and reveals some insight into the structural features that are related to the noneletrolytes.
采用一种新的方法——支持向量机(SVM)和启发式方法(HM),建立了217种非电解质在含氯化钠电解质中的溶解度与三个分子描述符之间的非线性和线性模型。代表化合物结构特征的分子描述符包括两个拓扑描述符和一个静电描述符。HM在CODESSA中选择的三个分子描述符用作SVM的输入。HM和SVM得到的结果均令人满意。HM模型对测试集的相关系数(R)为0.980,均方根误差(RMS)为0.219。相同的描述符也用于建立在纯水中的模型,预测结果与实验溶解度一致。此外,SVM对测试集得到的预测相关系数R = 0.988,RMS误差为0.170。预测结果与实验值非常吻合。本文提供了一种预测电解质中溶解度的新的有效方法,并揭示了一些与非电解质相关的结构特征。