Xue C X, Zhang R S, Liu H X, Liu M C, Hu Z D, Fan B T
Department of Chemistry, Lanzhou University, Lanzhou 730000, China.
J Chem Inf Comput Sci. 2004 Jul-Aug;44(4):1267-74. doi: 10.1021/ci049934n.
The support vector machine (SVM), as a novel type of learning machine, for the first time, was used to develop a Quantitative Structure-Property Relationship (QSPR) model of the heat capacity of a diverse set of 182 compounds based on the molecular descriptors calculated from the structure alone. Multiple linear regression (MLR) and radial basis function networks (RBFNNs) were also utilized to construct quantitative linear and nonlinear models to compare with the results obtained by SVM. The root-mean-square (rms) errors in heat capacity predictions for the whole data set given by MLR, RBFNNs, and SVM were 4.648, 4.337, and 2.931 heat capacity units, respectively. The prediction results are in good agreement with the experimental value of heat capacity; also, the results reveal the superiority of the SVM over MLR and RBFNNs models.
支持向量机(SVM)作为一种新型学习机,首次基于仅从结构计算得到的分子描述符,用于开发包含182种不同化合物的热容量定量结构-性质关系(QSPR)模型。多元线性回归(MLR)和径向基函数网络(RBFNNs)也被用于构建定量线性和非线性模型,以便与支持向量机得到的结果进行比较。由MLR、RBFNNs和支持向量机给出的整个数据集热容量预测的均方根(rms)误差分别为4.648、4.337和2.931个热容量单位。预测结果与热容量的实验值吻合良好;此外,结果还揭示了支持向量机相对于MLR和RBFNNs模型的优越性。