Liu Huanxiang, Yao Xiaojun, Zhang Ruisheng, Liu Mancang, Hu Zhide, Fan Botao
Department of Chemistry, Lanzhou University, Lanzhou 730000, China.
Chemosphere. 2006 May;63(5):722-33. doi: 10.1016/j.chemosphere.2005.08.031. Epub 2005 Oct 14.
The heuristic method (HM) and support vector machine (SVM) were used to build the linear and nonlinear quantitive structure-property relationship (QSPR) models for the prediction of the fish bioconcentration factors (BCF) for 122 diverse nonionic organic chemicals using the three descriptors calculated from the molecular structure alone and selected by HM. Both the linear and nonlinear model can give very satisfactory prediction results: the square of correlation coefficient R(2) was 0.929 and 0.953, the root mean square (RMS) error was 0.404 and 0.331, respectively for the whole dataset. The prediction result of the SVM model is better than that obtained by heuristic method, which proved SVM was a useful tool in the prediction of the BCF. At the same time, the HM model showed the influencing degree of different molecular descriptors on bioconcentration factors and then could improve the understanding for the bioconcentration mechanism of organic pollutants from molecular level.
采用启发式方法(HM)和支持向量机(SVM),利用仅从分子结构计算并经HM选择的三个描述符,构建线性和非线性定量结构-性质关系(QSPR)模型,以预测122种不同非离子有机化学品的鱼类生物富集因子(BCF)。线性和非线性模型均能给出非常令人满意的预测结果:对于整个数据集,相关系数R²分别为0.929和0.953,均方根(RMS)误差分别为0.404和0.331。SVM模型的预测结果优于启发式方法所得结果,这证明SVM是预测BCF的有用工具。同时,HM模型显示了不同分子描述符对生物富集因子的影响程度,进而可从分子水平提高对有机污染物生物富集机制的理解。