Department of Chemistry, Indian Institute of Technology Roorkee, Roorkee, UA 247667, India.
Talanta. 2011 Jan 15;83(3):1014-22. doi: 10.1016/j.talanta.2010.11.017. Epub 2010 Nov 11.
Quantitative structure-retention relationship (QSRR) models correlating the retention times of fatty acid methyl esters in high resolution capillary gas chromatography and their structures were developed based on non-linear and linear modeling methods. Genetic algorithm (GA) was used for the selection of the variables that resulted in the best-fitted models. Gravitational index (G2), number of cis double bond (NcDB) and number of trans double bond (NtDB) were selected among a large number of descriptors. The selected descriptors were considered as inputs for artificial neural networks (ANNs) with three different weights update functions including Levenberg-Marquardt backpropagation network (LM-ANN), BFGS (Broyden, Fletcher, Goldfarb, and Shanno) quasi-Newton backpropagation (BFG-ANN) and conjugate gradient backpropagation with Polak-Ribiére updates (CGP-ANN). Computational result indicates that the LM-ANN method has better predictive power than the other methods. The model was also tested successfully for external validation criteria. Standard error for the training set using LM-ANN was SE=0.932 with correlation coefficient R=0.996. For the prediction and validation sets, standard error was SE=0.645 and SE=0.445 and correlation coefficient was R=0.999 and R=0.999, respectively. The accuracy of 3-2-1 LM-ANN model was illustrated using leave multiple out-cross validations (LMO-CVs) and Y-randomization.
建立了脂肪酸甲酯在高分辨毛细管气相色谱中的保留时间与其结构的定量构效关系(QSRR)模型,这些模型基于非线性和线性建模方法。遗传算法(GA)用于选择变量,以得到最佳拟合模型。在大量描述符中选择了重力指数(G2)、顺式双键数(NcDB)和反式双键数(NtDB)。所选描述符被视为具有三种不同权重更新函数的人工神经网络(ANNs)的输入,包括 Levenberg-Marquardt 反向传播网络(LM-ANN)、Broyden、Fletcher、Goldfarb 和 Shanno 拟牛顿反向传播(BFG-ANN)和共轭梯度反向传播与 Polak-Ribiére 更新(CGP-ANN)。计算结果表明,LM-ANN 方法比其他方法具有更好的预测能力。该模型也成功地通过了外部验证标准的测试。使用 LM-ANN 的训练集的标准误差为 SE=0.932,相关系数为 R=0.996。对于预测集和验证集,标准误差分别为 SE=0.645 和 SE=0.445,相关系数分别为 R=0.999 和 R=0.999。通过多次留一交叉验证(LMO-CVs)和 Y-随机化,展示了 3-2-1 LM-ANN 模型的准确性。