Fatemi Mohammad Hossein, Ghorbannezhad Zahra
Laboratory of Chemometrics, Faculty of Chemistry, University of Mazandaran, Babolsar, Iran.
J Chromatogr Sci. 2011;49(6):476-81. doi: 10.1093/chrsci/49.6.476.
In the present work, the quantitative structure-retention relationship (QSRR) was used to predict the gas chromatographic retention factors of some organic nucleuphile on chemically modified stationary phase by complexes of Cu (II) with amino groups. The gravitation index, relative negative charge surface area, C component of moment of inertia and weighted negative charged partial surface area are selected as the most relevant descriptors from the pool of descriptors. These descriptors were used for developing multiple linear regression (MLR) and artificial neural network (ANN) models as linear and nonlinear feature mapping techniques. The root mean square errors (RMES) in calculation of retention factors for training, internal and external test set are 0.242, 0.295, and 0.240, respectively for MLR model, and for ANN model the RMSE for training, internal and external test set are; 0.084, 0.108, and 0.176. The ANN and MLR model were further examined by cross validation test, which obtained statistics of Q2 = 0.82 and SPRESS = 0.22 for MLR model and Q2 = 0.97, SPRESS = 0.07 for ANN model. Comparison between these results and other statistics of ANN and MLR models revealed the superiority of ANN over MLR model.
在本研究中,采用定量结构-保留关系(QSRR)来预测一些有机亲核试剂在铜(II)与氨基形成的配合物修饰固定相上的气相色谱保留因子。从众多描述符中选择引力指数、相对负电荷表面积、惯性矩的C分量和加权负电荷部分表面积作为最相关的描述符。这些描述符用于开发多元线性回归(MLR)和人工神经网络(ANN)模型,作为线性和非线性特征映射技术。对于MLR模型,训练集、内部测试集和外部测试集保留因子计算中的均方根误差(RMES)分别为0.242、0.295和0.240,对于ANN模型,训练集、内部测试集和外部测试集的RMSE分别为0.084、0.108和0.176。通过交叉验证测试对ANN和MLR模型进行了进一步检验,MLR模型得到的统计量为Q2 = 0.82和SPRESS = 0.22,ANN模型的Q2 = 0.97,SPRESS = 0.07。这些结果与ANN和MLR模型的其他统计数据的比较表明,ANN优于MLR模型。